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(57) Abstract 

Compositions and methods for the detection and therapy of breast cancer are disclosed. The compounds provided include nucleotide 
sequences that are preferentially expressed in breast tumor tissue, as well as polypeptides encoded by such nucleotide sequences. Vaccines 
and pharmaceutical compositions comprising such compounds are also provided and may be used, for example, for the prevention and 
treatment of breast cancer. The polypeptides may also be used for the production of antibodies, which are useful for diagnosing and 
monitoring the progression of breast cancer in a patient. 
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Desca-iption 

COMPOSITIONS AND METHODS FOR THE TREATMENT 
AND DIAGNOSIS OF BREAST CANCER 

5 

Technical Field 

The present invention relates generally to the detection and therapy of 
breast cancer. The invention is more specifically related to nucleotide sequences that 
are preferentially expressed in breast tumor tissue and to polypeptides encoded by such 
10 nucleotide sequences. The nucleotide sequences and polypeptides may be used in 
vaccines and pharmaceutical compositions for the prevention and treatment of breast 
cancer. The polypeptides may also be used for the production of compounds, such as 
antibodies, usefiil for diagnosing and monitoring the progression of breast cancer in a 
patient. 

15 

Background of the Invention 

Breast cancer is a significant health problem for women in the United 
States and throughout the world. Although advances have been made in detection and 
treatment of the disease, breast cancer remains the second leading cause of cancer- 
20 related deaths in women, affecting more than 1 80,000 women in the United States each 
year. For women in North America, the life-time odds of getting breast cancer are now 
one in eight. 

No vaccine or other universally successfiil method for the prevention or 
treatment of breast cancer is currently available. Management of the disease currently 

25 relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment, which may include one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and hormone therapy. The course of 
treatment for a particular breast cancer is often selected based on a variety of prognostic 
parameters, including an analysis of specific tumor markers. See, e.g., Porter-Jordan 

30 and Lippman, Breast Cancer 5:73-100 (1994). However, the use of established 
markers often leads to a result that is difficult to interpret, and the high mortality 
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observed in breast cancer patients indicates that improvements are needed in the 
treatment, diagnosis and prevention of the disease. 

Accordingly, there is a need in the art for improved methods for therapy 
and diagnosis of breast cancer. The present invention fiilfills these needs and further 
5 provides other related advantages. 

Summary of the Invention 

Briefly stated, the subject invention provides compositions and methods 
for the diagnosis and therapy of breast cancer. In one aspect, isolated DNA molecules 

10 are provided, comprising (a) a nucleotide sequence preferentially expressed in breast 
cancer tissue, relative to normal tissue; (b) a variant of such a sequence that contains 
one or more nucleotide substitutions, deletions, insertions and/or modifications at no 
more than 20% (preferably no more than 5%) of the nucleotide positions, such that the 
antigenic and/or immunogenic properties of the polypeptide encoded by the nucleotide 

15 sequence are retained; or (c) a nucleotide sequence encoding an epitope of a 
polypeptide encoded by at least one of the above sequences. In one embodiment, the 
isolated DNA molecule comprises a human endogenous retroviral sequence recited in 
SEQ ID N0:1. In other embodiments, the isolated DNA molecule comprises a 
nucleotide sequence recited in any one of SEQ ID N0:3 - SEQ ID NO:77 or SEQ ID 

20 NOS: 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 
209-214,216,218,219, 221-227. 

In related embodiments, the isolated DNA molecule encodes an epitope 
of a polypeptide, wherein the polypeptide is encoded by a nucleotide sequence that: (a) 
hybridizes to a sequence recited in any one of SEQ ID N0:1 or SEQ ID N0:3 - SEQ ID 

25 NO:77 or SEQ ID NOS: 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200- 
204, 206, 207, 209-214, 216, 218, 219, 221-227 under stringent conditions; and (b) is at 
least 80% identical to a sequence recited in any one of SEQ ID N0:1 or SEQ ID N0:3 - 
SEQ ID NO:77 or SEQ ID NOS: 142, 143, 146-152, 154-166, 168-176, 178-192, 194- 
198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227; and wherein RNA 

30 corresponding to said nucleotide sequence is expressed at a greater level in human 
breast tumor tissue than in normal breast tissue. 
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In another embodiment, the present invention provides an isolated DNA 
molecule encoding an epitope of a polypeptide, the polypeptide being encoded by: (a) a 
nucleotide sequence transcribed from the sequence of SEQ ID NO: 141 ; or (b) a variant 
of said nucleotide sequence that contains one or more nucleotide substitutions, 
5 deletions, insertions and/or modifications at no more than 20% of the nucleotide 
positions, such that the antigenic and/or immunogenic properties of the polypeptide 
encoded by the nucleotide sequence are retained. Isolated DNA and RNA molecules 
comprising a nucleotide sequence complementary to a DNA molecule as described 
above are also provided. 

^" related aspects, the present invention provides recombinant 
expression vectors comprising a DNA molecule as described above and host cells 
transformed or transfected with such expression vectors. 

In fiirther aspects, polypeptides, comprising an amino acid sequence 
encoded by a DNA molecule as described above, and monoclonal antibodies that bind 
1 5 to such polypeptides are provided. 

In yet another aspect, methods are provided for determining the presence 
of breast cancer in a patient. In one embodiment, the method comprises detecting, 
within a biological sample, a polypeptide as described above. In another embodiment, 
the method comprises detecting, witiiin a biological sample, an RNA molecule 
20 encoding a polypeptide as described above. In yet another embodiment, the method 
comprises (a) intradermally injecting a patient with a polypeptide as described above; 
and (b) detecting an immune response on the patient's skin and tiierefrom detecting the 
presence of breast cancer in the patient. In further embodiments, the present invention 
provides methods for determining the presence of breast cancer in a patient as described 
25 above wherein tiie polypeptide is encoded by a nucleotide sequence selected from the 
group consisting of SEQ ID NO.: 78-86, SEQ ID NOS: 144, 145, 153, 167, 177, 193, 
199, 205, 208, 215. 217, 220 and sequences that hybridize thereto under stringent 
conditions. 

In a related aspect, diagnostic kits useful in the determination of breast 
30 cancer are provided. The diagnostic kits generally comprise either one or more 
monoclonal antibodies as described above, or one or more monoclonal antibodies that 
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bind to a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of sequences provided in SEQ ID NOS: 78-86 and SEQ ID NOS: 144, 145, 
153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and a detection reagent. 

Within a related aspect, the diagnostic kit comprises a first polymerase 

5 chain reaction primer and a second polymerase chain reaction primer, the first and 
second primers each comprising at least about 10 contiguous nucleotides of an RNA 
molecule as described above, or an RNA molecule encoding a polypeptide encoded by 
a nucleotide sequence selected from the group consisting of SEQ ID NOS: 78-86 and 
SEQ ID NOS: 144, 145, 153, 167, 177, 193, 199,205,208,215,217,220. 

10 Within another related aspect, the diagnostic kit comprises at least one 

oligonucleotide probe, the probe comprising at least about 15 contiguous nucleotides of 
a DNA molecule as described above, or a DNA molecule selected from the group 
consisting of SEQ ID NOS: 78-86 and SEQ ID NOS: 144, 145, 153, 167, 177, 193, 
199, 205, 208,215,217, 220. 

15 In another related aspect, the present invention provides methods for 

monitoring the progression of breast cancer in a patient. In one embodiment, the 
method comprises: (a) detecting an amount, in a biological sample, of a polypeptide as 
described above at a first point in time; (b) repeating step (a) at a subsequent point in 
time; and (c) comparing the amounts of polypeptide detected in steps (a) and (b), and 

20 therefrom monitoring the progression of breast cancer in the patient. In another 
embodiment, the method comprises (a) detecting an amount, within a biological 
sample, of an RNA molecule encoding a polypeptide as described above at a first point 
in time; (b) repeating step (a) at a subsequent point in time; and (c) comparing the 
amounts of RNA molecules detected in steps (a) and (b), and therefrom monitoring the 

25 progression of breast cancer in the patient. In yet other embodiments, the present 
invention provides methods for monitoring the progression of breast cancer in a patient 
as described above wherein the polypeptide is encoded by a nucleotide sequence 
selected form the group consisting of SEQ ID NO.: 78-86, SEQ ID NOS:144, 145, 153, 
167, 177, 193, 199, 205, 208, 215, 217, 220 and sequences that hybridize thereto under 

30 stringent conditions. 
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In still other aspects, pharmaceutical compositions, which comprise a 
polypeptide as described above in combination with a physiologically acceptable 
carrier, and vaccines, which comprise a polypeptide as described above in combination 
with an immune response enhancer or adjuvant are provided. In yet other aspects, the 
5 present invention provides pharmaceutical compositions and vaccines comprising a 
polypeptide encoded by a nucleotide sequence selected form the group consisting of 
SEQ ID NO.: 78-86, SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217, 220 and sequences that hybridize thereto under stringent conditions. 

In related aspects, the present invention provides methods for inhibiting 
10 the development of breast cancer in a patient, comprising administering to a patient a 
pharmaceutical composition or vaccine as described above. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
15 references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

Brief Description of the Drawings 

Figure 1 shows the differential display PGR products, separated by gel 
20 electrophoresis, obtained from cDNA prepared from normal breast tissue (lanes 1 and 

2) and from cDNA prepared from breast tumor tissue from the same patient (lanes 3 

and 4). The arrow indicates the band corresponding to Bl 8Agl . 

Figure 2 is a northern blot comparing the level of B18Agl mRNA in 

breast tumor tissue (lane 1) vAth the level in normal breast tissue. 
25 Figure 3 shows the level of B18Agl mRNA in breast tumor tissue 

compared to that in various normal and non-breast tumor tissues as determined by 

RNase protection assays. 

Figure 4 is a genomic clone map showing the location of additional 
retroviral sequences obtained from ends of Xbal restriction digests (provided in SEQ ID 
30 N0:3 - SEQ ID NO:10) relative to B18Agl. 
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Figures 5A and 5B show the sequencing strategy, genomic organization 
and predicted open reading frame for the retroviral element containing Bl 8Agl . 

Figure 6 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA Bl 8Agl . 
5 Figure 7 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B17Agl. 

Figure 8 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B17Ag2. 

Figure 9 shows the nucleotide sequence of the representative breast 
10 tumor-specific cDNA B 1 3 Ag2a. 

Figure 10 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B13Aglb. 

Figure 1 1 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B13Agla. 
15 Figure 12 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA Bl 1 Agl. 

Figure 13 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B3CA3c. 

Figure 14 shows the nucleotide sequence of the representative breast 
20 tumor-specific cDN A B9CG 1 . 

Figure 15 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B9CG3. 

Figure 16 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B2CA2. 
25 Figure 17 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B3CA1. 

Figure 18 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B3CA2. 

Figure 19 shows the nucleotide sequence of the representative breast 
30 tumor-specific cDNA B3CA3. 
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Figure 20 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B4CA1. 

Figure 21 A depicts RT-PCR analysis of breast tumor genes in breast 
tumor tissues (lanes 1-8) and normal breast tissues (lanes 9-13) and H2O (lane 14). 
5 Figure 21B depicts RT-PCR analysis of breast tumor genes in prostate 

tumors (lane 1,2), colon tumors (lane 3), lung tumor (lane 4), normal prostate (lane 5), 
normal colon (lane 6), normal kidney (lane 7), normal liver (lane 8), normal lung (lane 
9), normal ovary (lanes 10, 18), normal pancreases (lanes 11, 12), normal skeletal 
muscle (lane 13), normal skin (lane 14), nomal stomach (lane 15), norma! testes (lane 
10 16), normal small intestine (lane 17), HBL-100 (lane 19), MCF-12A (lane 20), breast 
tumors (lanes 21-23), Hp (lane 24), and colon tumor (lane 25). 

Detailed Description of the Invention 

As noted above, the present invention is generally directed to 
1 5 compositions and methods for the diagnosis, monitoring and therapy of breast cancer. 
The compositions described herein include polypeptides, nucleic acid sequences and 
antibodies. Polypeptides of the present invention generally comprise at least a portion 
of a protein that is expressed at a greater level in human breast tumor tissue than in 
normal breast tissue (i.e., the level of RNA encoding the polypeptide is at least 2-fold 
20 higher in tumor tissue). Such polypeptides are referred to herein as breast tumor- 
specific polypeptides, and cDNA molecules encoding such polypeptides are referred to 
as breast tumor-specific cDNAs. Nucleic acid sequences of the subject invention 
generally comprise a DNA or RNA sequence that encodes all or a portion of a 
polypeptide as described above, or that is complementary to such a sequence. 
25 Antibodies are generally immune system proteins, or fragments thereof, that are 
capable of binding to a portion of a polypeptide as described above. Antibodies can be 
produced by cell culture techniques, including the generation of monoclonal antibodies 
as described herein, or via transfection of antibody genes into suitable bacterial or 
mammalian cell hosts, in order to allow for the production of recombinant antibodies. 
30 Polypeptides within the scope of this invention include, but are not 

limited to, polypeptides (and epitopes thereof) encoded by a human endogenous 
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retroviral sequence, such as the sequence designated BlSAgl (Figure 5 and SEQ ID 
N0:1). Also within the scope of the present invention are polypeptides encoded by 
other sequences within the retroviral genome containing BlSAgl (SEQ ID NO: 141). 
Such sequences include, but are not limited to, the sequences recited in SEQ ID N0:3 - 

5 SEQ ID NO: 10. B 1 8Agl has homology to the gag p30 gene of the endogenous human 
retroviral element S71, as described in Werner et ah. Virology 774:225-238 (1990) and 
also shows homology to about thirty other retroviral gag genes. As discussed in more 
detail below, the present invention also includes a number of additional breast tumor- 
specific polypeptides, such as those encoded by the nucleotide sequences recited in 

10 SEQIDN0:11 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166, 168- 
176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227. As used 
herein, the term "polypeptide" encompasses amino acid chains of any length, including 
full length proteins containing the sequences recited herein. A polypeptide comprising 
an epitope of a protein containing a sequence as described herein may consist entirely 

15 of the epitope, or may contain additional sequences. The additional sequences may be 
derived from the native protein or may be heterologous, and such sequences may (but 
need not) possess inununogenic or antigenic properties. 

An ''epitope," as used herein is a portion of a polypeptide that is 
recognized (Le., specifically bound) by a B-cell and/or T-cell surface antigen receptor. 

20 Epitopes may generally be identified using well knov^ techniques, such as those 
summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 1993) 
and references cited therein. Such techniques include screening polypeptides derived 
fi-om the native polypeptide for the ability to react with antigen-specific antisera and/or 
T-cell lines or clones. An epitope of a polypeptide is a portion that reacts with such 

25 antisera and/or T-cells at a level that is similar to the reactivity of the full length 
polypeptide (e.g., in an ELISA and/or T-cell reactivity assay). Such screens may 
generally be performed using methods well known to those of ordinary skill in the art, 
such as those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold 
Spring Harbor Laboratory, 1988. B-cell and T-cell epitopes may also be predicted via 

30 computer analysis. Polypeptides comprising an epitope of a polypeptide that is 
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preferentially expressed in a tumor tissue (with or without additional amino acid 
sequence) are within the scope of the present invention. 

The compositions and methods of the present invention also encompass 
variants of the above polypeptides and nucleic acid sequences encoding such 
5 polypeptides. A polypeptide 'Variant/' as used herein, is a polypeptide that differs from 
the native polypeptide in substitutions and/or modifications, such that the antigenic 
and/or immxmogenic properties of the polypeptide are retained. Such variants may 
generally be identified by modifying one of the above polypeptide sequences and 
evaluating the reactivity of the modified polypeptide with antisera and/or T-cells as 

10 described above. Nucleic acid variants may contain one or more substitutions, 
deletions, insertions and/or modifications such that the antigenic and/or immunogenic 
properties of the encoded polypeptide are retained. One preferred variant of the 
polypeptides described herein is a variant that contains nucleotide substitutions, 
deletions, insertions and/or modifications at no more than 20% of the nucleotide 

1 S positions. 

Preferably, a variant contains conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
20 polypeptide to be substantially unchanged. In general, the following groups of amino 
acids represent conservative changes: (l)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. 

Variants may also (or alternatively) be modified by, for example, the 
25 deletion or addition of amino acids that have minimal influence on the immunogenic or 
antigenic properties, secondary structure and hydropathic nature of the polypeptide. 
For example, a polypeptide may be conjugated to a signal (or leader) sequence at the N- 
terminal end of the protein which co-translationally or post-translationally directs 
transfer of the protein. The polypeptide may also be conjugated to a linker or other 
30 sequence for ease of synthesis, purification or identification of the polypeptide {e.g,, 
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poly-His), or to enhance binding of the polypeptide to a soHd support. For example, a 
polypeptide may be conjugated to an inmiunoglobulin Fc region. 

In general, nucleotide sequences encoding all or a portion of the 
polypeptides described herein may be prepared using any of several techniques. For 

5 example, cDNA molecules encoding such polypeptides may be cloned on the basis of 
the breast tumor-specific expression of the corresponding mRNAs, using differential 
display PGR. This technique compares the amplified products from RNA template 
prepared from normal and breast tumor tissue. cDNA may be prepared by reverse 
transcription of RNA using a (dT),2AG primer. Following amplification of the cDNA 

10 using a random primer, a band corresponding to an amplified product specific to the 
tumor RNA may be cut out from a silver stained gel and subcloned into a suitable 
vector (e g,, the T-vector, Novagen, Madison, WI). Nucleotide sequences encoding all 
or a portion of the breast tumor-specific polypeptides disclosed herein may be amplified 
from cDNA prepared as described above using the random primers shown in SEQ ID 

15 N0.:87-125. 

Alternatively, a gene encoding a polypeptide as described herein (or a 
portion thereof) may be amplified from human genomic DNA, or from breast tumor 
cDNA, via polymerase chain reaction. For this approach, BlSAgl sequence-specific 
primers may be designed based on the sequence provided in SEQ ID N0:1, and may be 

20 purchased or synthesized. One suitable primer pair for amplification from breast tumor 
cDNA is (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID NO.:126) and 
(5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID NO.: 127). An amplified portion 
of BlSAgl may then be used to isolate the full length gene from a human genomic 
DNA library or fit)m a breast tumor cDNA library, using well known techniques, such 

25 as those described in Sambrook et al.. Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratories, Cold Spring Harbor, NY (1989). Other sequences within 
the retroviral genome of which BlSAgl is a part may be similarly prepared by 
screening human genomic libraries using BlSAgl -specific sequences as probes. 
Nucleotides translated into protein from the retroviral genome shown in SEQ ID NO: 

30 141 may then be determined by cloning the corresponding cDNAs, predicting the open 
reading frames and cloning the appropriate cDNAs into a vector containing a viral 
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promoter, such as T7. The resulting constructs can be employed in a translation 
reaction, using techniques known to those of skill in the art, to identify nucleotide 
sequences which result in expressed protein. Similarly, primers specific for the 
remaining breast tumor-specific polypeptides described herein may be designed based 
5 on the nucleotide sequences provided in SEQ ID NO: 11 - SEQ ID NO:86 and SEQ ID 
NO:142-SEQ IDNO:226. 

Recombinant polypeptides encoded by the DNA sequences described 
above may be readily prepared firom the DNA sequences. For example, supematants 
firom suitable host/vector systems which secrete recombinant protein or polypeptide 

10 into culture media may be first concentrated using a conunercially available filter. 
Following concentration, the concentrate may be applied to a suitable purification 
matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse 
phase HPLC steps can be employed to further purify a recombinant polypeptide. 

In general, any of a variety of expression vectors known to those of 

15 ordinary skill in the art may be employed to express recombinant polypeptides of this 
invention. Expression may be achieved in any appropriate host cell that has been 
transformed or transfected with an expression vector containing a DNA molecule that 
encodes a recombinant polypeptide. Suitable host cells include prokaryotes, yeast and 
higher eukaryotic cells. Preferably, the host cells employed are E. coli, yeast or a 

20 mammalian cell line such as COS or CHO. 

Such techniques may also be used to prepare polypeptides comprising 
epitopes or variants of the native polypeptides. For example, variants of a native 
polypeptide may generally be prepared using standard mutagenesis techniques, such as 
oligonucleotide-directed site-specific mutagenesis, and sections of the DNA sequence 

25 may be removed to permit preparation of truncated polypeptides. Portions and other 
variants having fewer than about 100 amino acids, and generally fewer than about 50 
amino acids, may also be generated by synthetic means, using techniques well known 
to those of ordinary skill in the art. For example, such polypeptides may be synthesized 
using any of the commercially available solid-phase techniques, such as the Merrifield 

30 solid-phase synthesis method, where amino acids are sequentially added to a growing 
amino acid chain. See Menrifield, J. Am. Chem, Soc. 55:2149-2146 (1963). Equipment 
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for automated synthesis of polypeptides is commercially available from suppliers such 
as Applied BioSystems, Inc., Foster City, CA, and may be operated according to the 
manufacturer's instructions. 

In specific embodiments, polypeptides of the present invention 

5 encompass amino acid sequences encoded by a DNA molecule having a sequence 
recited in any one of SEQ ID NO:l or SEQ ID N0:3 - SEQ ID NO:77 or SEQ ID 
NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198,200-204, 206, 207, 209- 
214, 216, 218, 219, 221-227, variants of such polypeptides that are encoded by DNA 
molecules containing one or more nucleotide substitutions, deletions, insertions and/or 

10 modifications at no more than 20% of the nucleotide positions, and epitopes of the 
above polypeptides. Polypeptides within the scope of the present invention also include 
polypeptides (and epitopes thereof) encoded by DNA sequences that hybridize to a 
DNA molecule having a sequence recited in any one of SEQ ID N0:1 or SEQ ID N0:3 
- SEQ ID NO:77 or SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194- 

15 198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227 under stringent conditions, 
wherein the DNA sequences are at least 80% identical in overall sequence to a recited 
sequence and wherein RNA corresponding to the nucleotide sequence is expressed at a 
greater level in human breast tumor tissue than in normal breast tissue. As used herein, 
"stringent conditions" refers to prewashing in a solution of 6X SSC, 0.2% SDS; 

20 hybridizing at 65^C, 6X SSC, 0.2% SDS overnight; followed by two washes of 30 
minutes each in IX SSC, 0. 1% SDS at 65*'C and two washes of 30 minutes each in 1 X 
SSC, 0.1% SDS at eS^'C. DNA molecules according to the present invention include 
molecules that encode any of the above polypeptides. 

In another aspect of the present invention, antibodies are provided. Such 

25 antibodies may be prepared by any of a variety of techniques known to those of 
ordinary skill in the art. See, e.g., Harlow and Lane, Antibodies: A Laboratory 
Manual Cold Spring Harbor Laboratory, 1988. In one such technique, an immunogen 
comprising the polypeptide is initially injected into any of a wide variety of mammals 
(e.g., mice, rats, rabbits, sheep or goats). In this step, the polypeptides of this invention 

30 may serve as the immunogen without modification. Alternatively, particularly for 
relatively short polypeptides, a superior immune response may be elicited if the 
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polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole 
limpet hemocyanin. The inununogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster 
immunizations, and the animals are bled periodically. Polyclonal antibodies specific 
5 for the polypeptide may then be purified fix)m such antisera by, for example, affinity 
chromatography using the polypeptide coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. 1 
Immunol 5:5 11 -5 19 (1976), and improvements thereto. Briefly, these methods involve 
10 the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity {Le,, reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained fi-om an animal immunized as 
described above. The spleen cells are then inraiortalized by, for example, fiision with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 

15 animal. A variety of fusion techniques may be employed. For example, the spleen 
cells and myeloma cells may be combined with a nonionic detergent for a few minutes 
and then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 

20 colonies of hybrids are observed. Single colonies are selected and their culture 
supematants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated fi-om the supematants of 
growing hybridoma colonies. In addition, various techniques may be employed to 

25 enhance the yield, such as injection of the hybridoma cell line into the peritoneal cavity 
of a suitable vertebrate host, such as a mouse. Monoclonal antibodies may then be 
harvested fi-om the ascites fluid or the blood. Contaminants may be removed fi-om the 
antibodies by conventional techniques, such as chromatography, gel filtration, 
precipitation, and extraction. The polypeptides of this invention may be used in the 

30 purification process in, for example, an affinity chromatography step. 



wo 97/25426 



PCTAJS97/00485 



14 

Antibodies may be used, for example, in methods for detecting breast 
cancer in a patient. Such methods involve using an antibody to detect the presence or 
absence of a breast tumor-specific polypeptide as described herein in a suitable 
biological sample. As used herein, suitable biological samples include tumor or normal 

S tissue biopsy, mastectomy, blood, lymph node, serum or urine samples, or other tissue, 
homogenate, or extract thereof obtained from a patient. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using an antibody to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 

10 1988. For example, the assay may be performed in a Western blot format, wherein a 
protein preparation from the biological sample is submitted to gel electrophoresis, 
transferred to a suitable membrane and allowed to react with the antibody. The 
presence of the antibody on the membrane may then be detected using a suitable 
detection reagent, as described below. 

15 In another embodiment, the assay involves the use of antibody 

immobilized on a solid support to bind to the polypeptide and remove it from the 
remainder of the sample. The bound polypeptide may then be detected using a second 
antibody or reagent that contains a reporter group. Alternatively, a competitive assay 
may be utilized, in which a polypeptide is labeled udth a reporter group and allowed to 

20 bind to the immobilized antibody after incubation of the antibody with the sample. The 
extent to which components of the sample inhibit the binding of the labeled polypeptide 
to the antibody is indicative of the reactivity of the sample with the immobilized 
antibody, and as a result, indicative of the concentration of polypeptide in the sample. 

The solid support may be any material known to those of ordinary skill 

25 in the art to which the antibody may be attached. For example, the solid support may 
be a test well in a microtiter plate or a nitrocellulose filter or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 

30 Patent No. 5,359,681. 
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The antibody may be immobilized on the solid support using a variety of 
techniques known to those in the art, which are amply described in the patent and 
scientific literature. In the context of the present invention, the term "immobilization" 
refers to both noncovalent association, such as adsorption, and covalent attachment 
5 (which may be a direct linkage between the antigen and fimctional groups on the 
support or may be a linkage by way of a cross-linking agent). Immobilization by 
adsorption to a well in a microtiter plate or to a membrane is preferred. In such cases, 
adsorption may be achieved by contacting the antibody, in a suitable buffer, with the 
solid support for a suitable amount of time. The contact time varies with temperature, 

10 but is typically between about 1 hour and 1 day. In general, contacting a well of a 
plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of 
antibody ranging from about 10 ng to about 1 ^g, and preferably about 100-200 ng, is 
sufficient to immobilize an adequate amount of polypeptide. 

Covalent attachment of antibody to a solid support may also generally be 

15 achieved by first reacting the support with a bifunctional reagent that will react with 
both the support and a functional group, such as a hydroxyl or amino group, on the 
antibody. For example, the antibody may be covalently attached to supports having an 
appropriate polymer coating using benzoquinone or by condensation of an aldehyde 
group on the support with an amine and an active hydrogen on the binding partner {see, 

20 e.g. , Pierce Immunotechnology Catalog and Handbook ( 1 99 1 ) at A 1 2-A 1 3 ). 

In certain embodiments, the assay for detection of polypeptide in a 
sample is a two-antibody sandwich assay. This assay may be performed by first 
contacting an antibody that has been immobilized on a solid support, commonly the 
well of a microtiter plate, with the biological sample, such that the polypeptide within 

25 the sample are allowed to bind to the inunobilized antibody. Unbound sample is then 
removed from the immobilized polypeptide-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added. The amount of second antibody that remains boimd to the solid support is then 
determined using a method appropriate for the specific reporter group, 

30 More specifically, once the antibody is immobilized on the support as 

described above, the remaining protein binding sites on the support are typically 



wo 97/25426 



PCT/US97/00485 



16 

blocked. Any suitable blocking agent known to those of ordinary skill in the art, such 
as bovine serum albumin or Tween 20^ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent^ such as 

5 phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (/.e., incubation time) is that period of time that is sufficient to detect the presence 
of polypeptide within a sample obtained from an individual with breast cancer. 
Preferably, the contact time is sufficient to achieve a level of binding that is at least 
95% of that achieved at equilibrium between bound and unbound polypeptide. Those 

10 of ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient. 

Unbound sample may then be removed by washing the solid support 

15 with an appropriate buffer, such as PBS containing 0.1% Tween 20*^^. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include enzymes (such as horseradish peroxidase), substrates, 
cofactors, inhibitors, dyes, radionuclides, luminescent groups, fluorescent groups and 
biotin. The conjugation of antibody to reporter group may be achieved using standard 

20 methods known to those of ordinary skill in the art. 

The second antibody is then incubated with the immobilized antibody- 
polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound second antibody is then removed 

25 and bound second antibody is detected xising the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 

30 reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
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specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of breast cancer, the signal 
detected from the reporter group that remains bound to the solid support is generally 
5 compared to a signal that corresponds to a predetermined cut-off value established from 
non-tumor tissue. In one prefeired embodiment, the cut-off value is the average mean 
signal obtained when the immobilized antibody is incubated with samples from patients 
without breast cancer. In general, a sample generating a signal that is three standard 
deviations above the predetermined cut-off value may be considered positive for breast 
10 cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curve, according to the method of Sackett et al.. Clinical 
Epidemiology: A Basic Science for Clinical Medicine, p. 106-7 (Little Brown and Co., 
1985). Briefly, in this embodiment, the cut-off value may be determined from a plot of 
pairs of true positive rates {i.e., sensitivity) and false positive rates (100%-specificity) 
1 5 that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand comer (/.e., the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be 
considered positive. Alternatively, the cut-off value may be shifted to the left along the 
20 plot, to minimize the false positive rate, or to the right, to minimize the false negative 
rate. In general, a sample generating a signal that is higher than the cut-off value 
determined by this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the antibody is immobilized on a membrane, such as 
25 nitrocellulose. In the flow-through test, the polypeptide vwthin the sample bind to the 
immobilized antibody as the sample passes through the membrane. A second, labeled 
antibody then binds to the antibody-polypeptide complex as a solution containing the 
second antibody flows through the membrane. The detection of bound second antibody 
may then be performed as described above. In the strip test format, one end of the 
30 membrane to which antibody is bound is immersed in a solution containing the sample. 
The sample migrates along the membrane through a region containing second antibody 
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and to the area of immobilized antibody. Concentration of second antibody at the area 
of immobilized antibody indicates the presence of breast cancer. Typically, the 
concentration of second antibody at that site generates a pattern, such as a line, that can 
be read visually. The absence of such a pattern indicates a negative result. In general, 

S the amount of antibody inunobilized on the membrane is selected to generate a visually 
discernible pattern when the biological sample contains a level of polypeptide that 
would be sufficient to generate a positive signal in the two-antibody sandwich assay, in 
the format discussed above. Preferably, the amount of antibody inunobilized on the 
membrane ranges from about 25 ng to about 1 ^g, and more preferably from about 50 

10 ng to about 1 ^ig. Such tests can typically be performed with a very small amount of 
biological sample. 

The presence or absence of breast cancer in a patient may also be 
determined by evaluating the level of mRNA encoding a breast tumor-specific 
polypeptide as described herein within the biological sample (e.g., a biopsy, 

15 mastectomy and/or blood sample from a patient) relative to a predetermined cut-off 
value. Such an evaluation may be achieved using any of a variety of methods known to 
those of ordinary skill in the art such as, for example, in situ hybridization and 
amplification by polymerase chain reaction. For example, polymerase chain reaction 
may be used to amplify sequences from cDNA prepared from RNA that is isolated from 

20 one of the above biological samples. Sequence-specific primers for use in such 
amplification may be designed based on the sequences provided in any one of SEQ ID 
N0:1 or SEQ ID NO: 11 - SEQ ID NO:86 and SEQ ID NO:142-SEQ ID NO:226, and 
may be purchased or synthesized. In the case of BlSAgl, as noted herein, one suitable 
primer pair is B18Agl-2 (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID 

25 NO.:126) and B18Agl-3 (5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID 
NO.: 127). The PGR reaction products may then be separated by gel electrophoresis 
and visualized according to methods well known to those of ordinary skill in the art. 
Amplification is typically performed on samples obtained from matched pairs of tissue 
(tumor and non-tumor tissue from the same individual) or from unmatched pairs of 

30 tissue (tumor and non-tumor tissue from different individuals). The amplification 
reaction is preferably performed on several dilutions of cDN A spanning two orders of 
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magnitude, A two-fold or greater increase in expression in several dilutions of the 
tumor sample as compared to the same dilution of the non-tumor sample is considered 
positive. 

Conventional RT-PCR protocols using agarose and ethidium bromide 
5 staining while important in defining gene specificity do not lend themselves to 
diagnostic kit development because of the time and effort required in making them 
quantitative (i.e., construction of saturation and/or titration curves), and their sample 
throughput. This problem is overcome by the development of procedures such as real 
time RT-PCR which allows for assays to be performed in single tubes, and in turn can 
10 be modified for use in 96 well plate formats. Instrumentation to perform such 
methodologies are available from ABI/Perkin Elmer. Alternatively, other high 
throughput assays using labelled probes (e.g., digoxygenin) in combination with 
labelled (e.g., enzyme fluorescent, radioactive) antibodies to such probes can also be 
used in the development of 96 well plate assays. 

15 In yet another method for determining the presence or absence of breast 

cancer in a patient, one or more of the breast tumor-specific polypeptides described 
may be used in a skin test. As used herein, a "skin test" is any assay performed directly 
on a patient in which a delayed-type hypersensitivity (DTH) reaction (such as swelling, 
reddening or dermatitis) is measured following intradermal injection of one or more 

20 polypeptides as described above. Such injection may be achieved using any suitable 
device sufficient to contact the polypeptide or polypeptides with dermal cells of the 
patient, such as a tuberculin syringe or 1 mL syringe. Preferably, the reaction is 
measured at least 48 hours after injection, more preferably 48-72 hours. 

The DTH reaction is a cell-mediated immune response, which is greater 

25 in patients that have been exposed previously to a test antigen (/.e., an immunogenic 
portion of a polypeptide employed, or a variant thereof). The response may measured 
visually, using a ruler. In general, a response that is greater than about 0.5 cm in 
diameter, preferably greater than about 5.0 cm in diameter, is a positive response, 
indicative of breast cancer. 

^0 The breast tumor-specific polypeptides described herein are preferably 

formulated, for use in a skin test, as pharmaceutical compositions containing at least 
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one polypeptide and a physiologically acceptable carrier, such as water, saline, alcohol, 
or a buffer. Such compositions typically contain one or more of the above polypeptides 
in an amount ranging from about 1 ^g to 100 ^g, preferably from about 10 ^g to SO \xg 
in a volume of 0.1 mL. Preferably, the carrier employed in such pharmaceutical 
5 compositions is a saline solution with appropriate preservatives, such as phenol and/or 
Tween 80^^. 

In other aspects of the present invention, the progression and/or response 
to treatment of a breast cancer may be monitored by performing any of the above assays 
over a period of time, and evaluating the change in the level of the response (i.e., the 

10 amount of polypeptide or mRNA detected or, in the case of a skin test, the extent of the 
immime response detected). For example, the assays may be performed every month to 
every other month for a period of ) to 2 years. In general, breast cancer is progressing 
in those patients in whom the level of the response increases over time. In contrast, 
breast cancer is not progressing when the signal detected either remains constant or 

1 5 decreases with time. 

In further aspects of the present invention, the compounds described 
herein may be used for the immunotherapy of breast cancer. In these aspects, the 
compounds (which may be polypeptides, antibodies or nucleic acid molecules) are 
preferably incorporated into pharmaceutical compositions or vaccines. Pharmaceutical 

20 compositions comprise one or more such compounds and a physiologically acceptable 
carrier. Vaccines may comprise one or more polypeptides and an immune response 
enhancer, such as an adjuvant or a liposome (into which the compound is incorporated). 
Pharmaceutical compositions and vaccines may additionally contain a delivery system, 
such as biodegradable microspheres which are disclosed, for example, in U.S. Patent 

25 Nos. 4,897,268 and 5,075,109. Pharmaceutical compositions and vaccines within the 
scope of the present invention may also contain other compounds, including one or 
more separate polypeptides. 

Alternatively, a vaccine may contain DNA encoding one or more of the 
polypeptides as described above, such that the polypeptide is generated in situ. In such 

30 vaccines, the DNA may be present within any of a variety of delivery systems known to 
those of ordinary skill in the art, including nucleic acid expression systems, bacteria and 
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viral expression systems. Appropriate nucleic acid expression systems contain the 
necessary DNA sequences for expression in the patient (such as a suitable promoter and 
terminating signal). Bacterial delivery systems involve the administration of a 
bacterium (such as Bacillus-Calmette-Guerrin) that expresses an immunogenic portion 
5 of the polypeptide on its cell surface. In a preferred embodiment, the DNA may be 
introduced using a viral expression system (e.g., vaccinia or other pox virus, retrovirus, 
or adenovirus), which may involve the use of a non-pathogenic (defective), replication 
competent virus. Techniques for incorporating DNA into such expression systems are 
well known to those of ordinary skill in the art. The DNA may also be "naked," as 
10 described, for example, in Ulmer et al.. Science 259:1745-1749 (1993), and reviewed 
by Cohen, 5dewe 259:1691-1692 (1993). The uptake of naked DNA may be increased 
by coating the DNA onto biodegradable beads, which are efficiently transported into 
the cells. 

While any suitable carrier known to those of ordinary skill in the art may 

15 be employed in the pharmaceutical compositions of this invention, the type of carrier 
will vary depending on the mode of administration. For parenteral administration, such 
as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, 
a wax or a buffer. For oral administration, any of the above carriers or a solid carrier, 
such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, 

20 cellulose, glucose, sucrose, and magnesium carbonate, may be employed. 
Biodegradable microspheres (e.g., polylactate polyglycolate) may also be employed as 
carriers for the pharmaceutical compositions of this invention. 

Any of a variety of adjuvants may be employed in the vaccines of this 
invention to nonspecifically enhance the inmiune response. Most adjuvants contain a 

25 substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, 
Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable 
adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, MI), Merck Adjuvant 65 (Merck 

30 and Company, Inc., Rahway, NJ), alum, biodegradable microspheres, monophosphoryl 
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lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2, -7, or -12, may also 
be used as adjuvants. 

The above pharmaceutical com]X)sitions and vaccines may be used, for 
example, for the therapy of breast cancer in a patient. As used herein, a "patient" refers 
5 to any warm-blooded animal, preferably a human. A patient may or may not be 
afflicted with breast cancer. Accordingly, the above pharmaceutical compositions and 
vaccines may be used to prevent the development of breast cancer or to treat a patient 
afflicted with breast cancer. To prevent the development of breast cancer, a 
pharmaceutical composition or vaccine comprising one or more polypeptides as 

10 described herein may be administered to a patient. Alternatively, naked DNA or 
plasmid or viral vector encoding the polypeptide may be administered. For treating a 
patient with breast cancer, the pharmaceutical composition or vaccine may comprise 
one or more polypeptides, antibodies or nucleotide sequences complementary to DNA 
encoding a polypeptide as described herein (e.g., antisense RNA or antisense 

1 S deoxyribonucleotide oligonucleotides). 

Routes and frequency of administration, as well as dosage, will vary 
from individual to individual. In general, the pharmaceutical compositions and 
vaccines may be administered by injection (e.g., intracutaneous, intramuscular, 
intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. Between 1 

20 and 10 doses may be administered for a 52-week period. Preferably, 6 doses are 
administered, at intervals of 1 month, and booster vaccinations may be given 
periodically thereafter. Alternate protocols may be appropriate for individual patients. 
A suitable dose is an amount of a compound that, when administered as described 
above, is capable of promoting an anti-tumor immune response. Such response can be 

25 monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent 
generation of cytolytic effector cells capable of killing the patient's tumor cells in vitro. 
Such vaccines should also be capable of causing an immune response that leads to an 
improved clinical outcome (e.g., more frequent remissions, complete or partial or 
longer disease-free survival) in vaccinated patients as compared to non-vaccinated 

30 patients. In general, for pharmaceutical compositions and vaccines comprising one or 
more polypeptides, the amount of each polypeptide present in a dose ranges from about 
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100 ng to 5 mg. Suitable dose sizes will vary with the size of the patient, but will 
typically range from about 0.1 mL to about 5 mL. 

The following Examples are offered by way of illustration and not by 
5 way of limitation. 
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EXAMPLES 
Example 1 

Preparation of Breast Tumor-Specific cDNAs Using Differential Display RT-PCR 

5 

This Example illustrates the preparation of cDNA molecules encoding 
breast tumor-specific polypeptides using a differential display screen. 



A> Preparation of B 1 8Ag 1 cDNA and Characterization of mRNA Expression 

10 Tissue samples were prepared from breast tumor and normal tissue of a 

patient with breast cancer that was confirmed by pathology after removal from the 
patient. Normal RNA and tumor RNA was extracted from the samples and mRNA was 
isolated and converted into cDNA using a (dT),2AG (SEQ ID NO.: 130) anchored 3* 
primer. DifiTerential display PCR was then executed using a randomly chosen primer 

15 (CTTCAACCTC) (SEQ ID NO.: 103). Amplification conditions were standard buffer 
containing 1 .5 mM MgClj, 20 pmol of primer, 500 pmol dNTP, and 1 unit of Taq DNA 
polymerase (Perkin-Elmer, Branchburg, NJ). Forty cycles of amplification were 
performed using 94°C denaturation for 30 seconds, 42**C annealing for 1 minute, and 
ll^C extension for 30 seconds. An RNA fingerprint containing 76 amplified products 

20 was obtained. Although the RNA fingerprint of breast tumor tissue was over 98% 
identical to that of the normal breast tissue, a band was repeatedly observed to be 
specific to the RNA fingerprint pattern of the tumor. This band was cut out of a silver 
stained gel, subcloned into the T-vector (Novagen, Madison, WI) and sequenced. 

The sequence of the cDNA, referred to as BlSAgl, is provided in SEQ 

25 ID N0:1. A database search of GENBANK and EMBL revealed that the BlSAgl 
fi-agment initially cloned is 77% identical to the endogenous human retroviral element 
S71, which is a truncated retroviral element homologous to the Simian Sarcoma Virus 
(SSV). S71 contains an incomplete gag gene, a portion of the pol gene and an LTR- 
like structure at the 3' terminus {see Werner et al.. Virology 774:225-238 (1990)). 

30 BlSAgl is also 64% identical to SSV in the region corresponding to the P30 (gag) 
locus. BlSAgl contains three separate and incomplete reading frames covering a 
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region which shares considerable homology to a wide variety of gag proteins of 
retroviruses which infect mammals. In addition, the homology to S71 is not just within 
the gag gene, but spans several kb of sequence including an LTR. 

BlSAgl -specific PGR primers were synthesized using computer 
5 analysis guidelines. RT-PCR amplification (94T, 30 seconds; 60°C -> 42T, 30 
seconds; 30 seconds for 40 cycles) confirmed that BlSAgl represents an actual 
mRNA sequence present at relatively high levels in the patient's breast tumor tissue. 
The primers used in amplification were B18Agl-l (CTG CCT GAG CCA CAA ATG) 
(SEQ ID NO.:128) and BlSAgM (CCG GAG GAG GAA GCT AGA GGA ATA) 

10 (SEQ ID NO.:129) at a 3.5 mM magnesium concentration and a pH of 8.5, and 
B18Agl-2 (ATG GCT ATT TTC GGG GCC TGA CA) (SEQ ID NO.:126) and 
B18Agl.3 (CCG GTA TCT CCT CGT GGT TATT) (SEQ ID NO.:127) at 2mM 
magnesium at pH 9.5. The same experiments showed exceedingly low to nonexistent 
levels of expression in this patient's normal breast tissue {see Figure 1). RT-PCR 

1 5 experiments were then used to show tiiat Bl 8Agl mRNA is present in nine other breast 
tumor samples (from Brazilian and American patients) but absent in, or at exceedingly 
low levels in, the normal breast tissue corresponding to each cancer patient. RT-PCR 
analysis has also shown that the B18Agl transcript is not present in various normal 
tissues (including lymph node, myocardium and liver) and present at relatively low 

20 levels in PBMC and lung tissue. The presence of B18Agl mRNA in breast tumor 
samples, and its absence from normal breast tissue, has been confirmed by Northern 
blot analysis, as shovm in Figure 2. 

The differentia] expression of B18Agl in breast tumor tissue was also 
confirmed by RNase protection assays. Figure 3 shows the level of B18Agl mRNA in 

25 various tissue types as determined in four different RNase protection assays. Lanes 1- 
12 represent various normal breast tissue samples, lanes 13-25 represent various breast 
tumor samples; lanes 26-27 represent normal prostate samples; lanes 28-29 represent 
prostate tumor samples; lanes 30-32 represent colon tumor samples; lane 33 represents 
normal aorta; lane 34 represents normal small intestine; lane 35 represents normal skin, 

30 lane 36 represents normal lymph node; lane 37 represents normal ovary; lane 38 
represents normal liver; lane 39 represents normal skeletal muscle; lane 40 represents a 
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first normal stomach sample, lane 41 represents a second normal stomach sample; lane 
42 represents a normal lung; lane 43 represents normal kidney; and lane 44 represents 
normal pancreas, Interexperimental comparison was facilitated by including a 
positive control RNA of known B-actin message abundance in each assay and 
5 normalizing the results of the different assays with respect to this positive control. 

RT-PCR and Southern Blot analysis has shown the BlSAgl locus to be 
present in human genomic DNA as a single copy endogenous retroviral element. A 
genomic clone of approximately 12-18 kb was isolated using the initial BlSAgl 
sequence as a probe. Four additional subclones were also isolated by Xbal digestion. 

10 Additional retroviral sequences obtained from the ends of the Xbal digests of these 
clones (located as shown in Figure 4) are shown as SEQ ID N0:3 - SEQ ID NO: 10, 
where SEQ ID N0:3 shows the location of the sequence labeled 10 in Figure 4, SEQ ID 
N0:4 shows the location of the sequence labeled 11-29, SEQ ID N0:5 shows the 
location of the sequence labeled 3, SEQ ID N0:6 shows the location of the sequence 

15 labeled 6, SEQ ID NO:7 shows the location of the sequence labeled 12, SEQ ID N0:8 
shows the location of the sequence labeled 13, SEQ ID N0:9 shows the location of the 
sequence labeled 14 and SEQ ID NO: 10 shows the location of the sequence labeled 1 1- 
22. 

Subsequent studies demonstrated that the 12-18 kb genomic clone 
20 contains a retroviral element of about 7.75 kb, as shown in Figures 5A and 5B. The 
sequence of this retroviral element is shown in SEQ ID NO: 141. The numbered line at 
the top of Figure 5A represents the sense strand sequence of the retroviral genomic 
clone. The box below this line shows the position of selected restriction sites. The 
arrows depict the different overlapping clones used to sequence the retroviral element. 
25 The direction of the arrow shows whether the single-pass subclone sequence 
corresponded to the sense or anti-sense strand. Figure 5B is a schematic diagram of the 
retroviral element containing BlSAgl depicting the organization of viral genes within 
the element. The open boxes correspond to predicted reading frames, starting with a 
methionine, found throughout the element. Each of the six likely reading frames is 
30 shown, as indicated to the left of the boxes, with frames 1-3 corresponding to those 
found on the sense strand. 
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Using the cDNA of SEQ ID N0:1 as a probe, a longer cDNA was 
obtained (SEQ ID NO:227) which contains minor nucleotide differences (less than 1%) 
compared to the genomic sequence shown in SEQ ID N0.141, 

5 ^ Preparation of cDNA Molecules Encoding Other Breast Tumor-Snecific 

Polypeptides 

Normal RNA and tumor RNA was prepared and mRNA was isolated 
and converted into cDNA using a (dT),2AG anchored 3' primer, as described above. 
Differential display PGR was thai executed using the randomly chosen primers SEQ 

10 ID NO.: 87-125. Amplification conditions were as noted above, and bands observed to 
be specific to the RNA fingerprint pattem of the tumor were cut out of a silver stained 
gel, subcloned into either the T-vector (Novagen, Madison, WI) or the pCRII vector 
(Invitrogen, San Diego, CA) and sequenced. The sequences are provided in SEQ ID 
N0:1 1 - SEQ ID NO:86. Of the 79 sequences isolated, 67 were found to be novel 

15 (SEQ ID NO.: 11-77) {see also Figures 6-20). Subsequent studies identified an 
additional 84 sequences (SEQ ID NOS: 142-226), of which 72 appeared to be novel 
(SEQ IDNOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 
207, 209-214, 216, 218, 219, 221-227). To the best of the inventors' knowledge none 
of the previously identified sequences have heretofore been shown to be expressed at a 

20 greater level in human breast tumor tissue than in normal breast tissue. 

Table I shows the level of representative breast tumor-specific 
transcripts present in normal breast tissue (columns BNI -BN7), breast tumor samples 
(columns BTI-BT12) and normal prostate, kidney, liver, lung, skin, small intestine, 
stomach, myocardium, lymph node, pancreas, skeletal muscle, ovary and aorta, as 

25 determined by RT-PCR analysis. A 0-3 grading scale for message abundance is used, 
with 0 denoting no detectable message and 3 a message level comparable to the control 
message (glyceraldehyde 3-phosphate dehydrogenase). The lack of data in a given box 
indicates that the tissue has not been tested for the presence or absence of that specific 
antigen. 
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Example 2 

Preparation of BlSAgl DNA from Human Genomic DNA 

This Example illustrates the preparation of BlSAgl DNA by 
5 amplification from human genomic DNA. 

BlSAgl DNA may be prepared from 250 ng human genomic DNA 
using 20 pmol of BlSAgl specific primers, 500 pmol dNTPS and 1 unit of Taq DNA 
polymerase (Perkin Elmer, Branchburg, NJ) using the following amplification 
parameters: 94°C for 30 seconds denaturing, 30 seconds 60°C to 42**C touchdown 
1 0 annealing in 2°C increments every two cycles and 72®C extension for 30 seconds. The 
last increment (a 42°C annealing temperature) should cycle 25 times. Primers were 
selected using computer analysis. Primers synthesized were BlSAgl-1, BlSAgl -2, 
B18Agl-3, andB18Agl-4. Primer pairs that may be used are 1+3, 1+4, 2+3, and 2+4. 

Following gel electrophoresis, the band corresponding to BlSAgl DNA 
1 5 may be excised and cloned into a suitable vector. 

Example 3 

Preparation of B 1 SAel DNA from Breast Tumor cDNA 

20 This Example illustrates the preparation of BlSAgl DNA by 

amplification from human breast tumor cDNA. 

First strand cDNA is synthesized from RNA prepared from human 
breast tumor tissue in a reaction mixture containing 500 ng poly A+ RNA, 200 pmol of 
the primer (T),2AG (/.e., TTT TTT TTT TTT AG) (SEQ ID NO: 130), IX first strand 

25 reverse transcriptase buffer, 6.7 mM DTT, 500 mmol dNTPs, and 1 unit AMV or 
MMLV reverse transcriptase (from any supplier, such as Gibco-BRL (Grand Island, 
NY)) in a final volume of 30 p.1. After first strand synthesis, the cDNA is diluted 
approximately 25 fold and 1 |al is used for amplification as described in Example 2. 
While some primer pairs can result in a heterogeneous population of transcripts, the 

30 primers BlSAgl-2 (5'ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID NO: 126) 
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and Bl 8Agl-3 (5'CCG GTA TCT CCT CGT GOG TAT T) (SEQ ID NO: 127) yield a 
single 151 bp amplification product. 

Example 4 

5 Identificatio n of B-cell and T-cell Enitopes of BlSAgl 

This Example illustrates the identification of BlSAgl epitopes. 
The BlSAgl sequence can be screened using a variety of computer 
algorithms. To detemine B-cell epitopes, the sequence can be screened for 
10 hydrophobicity and hydrophilicity values using the method of Hopp, Prog. Clin. Biol. 
Res. J 728:367-77 (1985) or, alternatively, Cease et al., J. Exp. Med. ] 64:1779.^4 
(1986) or Spouge et al., J. Immunol. 755:204-12 (1987). Additional Class II MHC 
(antibody or B-cell) epitopes can be predicted using programs such as AMPHI {e.g., 
Margalit et al.. J. Immunol. 138:22X3 (1987)) or the methods of Rothbard and Taylor 
1 5 {e.g , EMBO J. 7:93 (1 988)). 

Once peptides (15-20 amino acids long) are identified using these 
techniques, individual peptides can be synthesized using automated peptide synthesis 
equipment (available firom manufacturers such as Applied Biosystems, Inc., Foster 
City, CA) and techniques such as Merrifield synthesis. Following synthesis, the 
20 peptides can used to screen sera harvested from either normal or breast cancer patients 
to determine whether patients with breast cancer possess antibodies reactive with the 
peptides. Presence of such antibodies in breast cancer patient would confirm the 
immunogenicity of the specific B-cell epitope in question. The peptides can also be 
tested for their ability to generate a serologic or humoral immune in animals (mice, lats, 
25 rabbits, chimps etc.) following immunization in vivo. Generation of a peptide-specific 
antiserum following such inmiunization further confirms the immunogenicity of the 
specific B-cell epitope in question. 

To identify T-cell epitopes, the BlSAgl sequence can be screened using 
different computer algorithms which are useful in identifying 8-10 amino acid motifs 
30 within the BlSAgl sequence which are capable of binding to HLA Class I MHC 
molecules, (see, e.g., Rammensee et al., Immunogenetics ^7:178-228 (1995)). 
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Following synthesis such peptides can be tested for their ability to bind to class I MHC 
using standard binding assays (e.g,, Sette et al., J. Immunol. 753:5586-92 (1994)) and 
more importantly can be tested for their ability to generate antigen reactive cjrtotoxic T- 
cells following in vitro stimulation of patient or normal peripheral mononuclear cells 

5 using, for example, the methods of Bakker et al.. Cancer Res. 55:5330-34 (1995); 
Visseren et al., J. Immunol 75^:3991-98 (1995); Kawakami et al., J, Immunol, 
75-/:3961-68 (1995); and Kast et al., 1 Immunol 752:39(M-12 (1994). Successful 
in vitro generation of T-cells capable of killing autologous (bearing the same Class I 
MHC molecules) tumor cells following in vitro peptide stimulation further confirms the 

10 inununogenicity of the BlSAgl antigen. Furthermore, such peptides may be used to 
generate murine peptide and BlSAgl reactive cytotoxic T-cells following in vivo 
inununization in mice rendered transgenic for expression of a particular human MHC 
Class I haplotype (Vitiello et aL, J. Exp. Med 775:1007-15 (1991). 

A representative list of predicted B18Agl B-cell and T-cell epitopes, 

15 broken down according to predicted HLA Class I MHC binding antigen, is shown 
below: 

Predicted Th Motifs (B-ceil epitopes) (SEP IDNOS.: 131-133) 
SSGGRTFDDFHRYLLVGI 
20 QGAAQKPINLSKXIEVVQGHDE 
SPGVFLEHLQEAYRIYTPFDLSA 

Predicted HLA A2.1 Motifs (T-cell eoitopesl (SEP ID NOS.: 134-140) 

YLLVGIQGA 
25 GAAQKPINL 

NLSKXIEVV 

EVVQGHDES 

HLQEAYRIY 

NLAFVAQAA 
30 FVAQAAPDS 
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Example 5 

Characterization of Breast Tumor Genes Discovered bv Differential Display PPR 

5 The specificity and sensitivity of the breast tumor genes discovered by 

differential display PCR were determined using RT-PCR. This procedure enabled the 
rapid evaluation of breast tumor gene mRNA expression semiquantitatively without 
using large amounts of RNA. Using gene specific primers, mRNA expression levels in 
a variety of tissues were examined, including 8 breast tumors, 5 normal breasts, 2 
10 prostate tumors, 2 colon tumors, 1 lung tumor, and 14 other normal adult human 
tissues, including normal prostate, colon, kidney, liver, lung, ovary, pancreas, skeletal 
muscle, skin, stomach and testes. 

To ensure the semiquantitative nature of the RT-PCR, P-actin was used 
as internal control for each of the tissues examined. Serial dilutions of the first strand 
1 5 cDNAs were prepared and RT-PCR assays performed using P-actin specific primers. A 
dilution was then selected that enabled the linear range amplification of p-actin 
template, and which was sensitive enough to reflect the difference in the initial copy 
number. Using this condition, the p-actin levels were determined for each reverse 
transcription reaction fi-om each tissue. DNA contamination was minimized by DNase 
20 treatment and by assuring a negative result when using first strand cDNA that was 
prepared vwthout adding reverse transcriptase. 

Using gene specific primers, the mRNA expression levels were 
determined in a variety of tissues. To date 32 genes have been successfiilly examined 
by RT-PCR, three of which exhibit good specificity and sensitivity for breast tumors. 
25 Figures 21A and 21B depict the results for these three genes: B15AG-1 (SEQ ID 
NO:27), B31GAlb (SEQ ID NO:148) and B38GA2a (SEQ ID NO. 157). 

From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for the purpose of 
illustration, various modifications may be made without deviating from the spirit and 
30 scope of the invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(1) APPLICANT: Corixa Corporation 

(ii) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE 
TREATMENT AND DIAGNOSIS OF BREAST CANCER 

(ii1) NUMBER OF SEQUENCES: 227 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Columbia Center. 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(v1) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: lO-JAN-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Maki . David J. 

(B) REGISTRATION NUMBER: 31.392 

(C) REFERENCE/DOCKET NUMBER: 21012L419PC 
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(ix) TELECOHMUNICATI(»J INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEO ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..363 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

TTA GAG ACC CAA TTG GGA CCT AAT TGG GAC CCA AAT HC TCA AGT GGA 48 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
15 10 15 

GGG AGA ACT TTT GAC GAT HC CAC CGG TAT CTC CTC GTG GGT ATT GAG 96 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 
20 25 30 

GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 144 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
35 40 45 

CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 
Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
50 55 60 



wo 97/25426 



PCT/US97/00485 



36 



GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT A6C 
Ala Tyr Arg lie Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 



240 



CAT GCT Cn AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 
His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Sen Lys 
85 90 95 



288 



AGG AAA CTC CAA AAA CTA GAG GGA TTT TGC TGG AAT GAA TAC CAG TCA 
Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 



336 



GCT TTT AGA GAT AGC CTA AAA GGT TTT 
Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 



363 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
15 10 15 

Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 
20 25 30 



Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
35 40 45 
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Gin Gly His Asp Glu Sen Pro Gly Val Phe Leu Glu His Leu Gin Glu 
50 55 60 

Aid Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 95 

Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 

Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 

(2) INFORMATION FOR SEQ ID N0:3; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

TCTTA6AATC TTCATACCCC 6AACTCTTGG GAAAACTTTA ATCAGTCACC TACAGTCTAC 60 

CACCCATTTA GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTTTA AGATCCCCCA 120 

TCTTCAAAGC CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC AGGTAAATGC 180 

CAAAAAAGGT CCTAAACCCA 6CCCAGGCCA CCGTCTCCAA GAAAACTCAC CAGGAGAAAA 240 

GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT ACCTTCTAGT 300 
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ACTGGTAGAC ACCTTCTCTG GATGGACTGA AGCATTTGCT ACCAAAAACG AAACTGTCAA 360 

TATGGTAGTT AAGTTTTTAC TCAATGAAAT CATCCCTCGA CGTQGGCTGC CTGTTGCCAT 420 

AGGGTCTGAT AATGGAACG6 CCTTCGCCTT 6TCTATAGTT TAATCAGTCA 6TAAGGCGTT 480 

AAACATTCAA TGGAAGCTCC ATT6T6CCTA TCGACCCAGA GCTCT6GGCA AGTAGAACGC 540 

ATGAACTGCA CCCTAAAAAA ACACTCTTAC AAAATTAATC TTAAAAACCG GTGTTAATTG 600 

TGTTAGTCTC CTTCCCTTAG CCCTACHAG A6TTAAGGTG CACCCCHAC TGGGCTGG6T 660 

TCTHACCn TTGAAATCAT NTHNGGAAG GGGCTGCCTA TCTTTNCnA ACTAAAAAAN 720 

GCCCATTT6G CAAAAATTTC NCAACTAATT TNTACGTNCC TACGTCTCCC CAACAGGTAN 780 

AAAAATCTNC TGCCCTTTTC AAGGAACCAT CCCATCCATT CCTNAACAAA AGGCCTGCCN 840 

TTCTTCCCCC AGTTAACTNT TTTHNTTAA AATTCCCAAA AAANGAACCN CCTGCTGGAA 900 

AAACNCCCCC CTCCAANCCC CGGCCNAAGN GGAAGGTTCC CHGAATCCC NCCCCCNCNA 960 

ANGGCCCG6A ACCNTTAAAN TNGnCCNGG GGGTNNGGCC TAAAAGNCCN ATTTGGTAAA 1020 

CCTANAAATT TTHCTTTTN TAAAAACCAC NNTTTNNTTT TTCTTAAACA AAACCCTNTT 1080 

TNTAGNANCN TAnTCCCNC C HOI 
(2) INFORMATION FOR SEQ ID N0;4; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

TCTAGAGCTG CGCCTGGATC CCGCCACAGT GAGGAGACCT GAAGACCAGA GAAAACACAG 60 

CAAGTAGGCC CTTTAAACTA CTCACCTGTG TTGTCTTCTA ATTTAHCTG HTTATTTTG 120 

TTTCCATCAT TTTAAGGGGT TAAAATCATC TTGTTCAGAC CTCAGCATAT AAAATGACCC 180 

ATCTGTAGAC CTCAGGCTCC AACCATACCC CAAGAGTTGT CTGCHTTGI TTAAATTACl 240 

GCCAGGTTTC A6CTGCAGAT ATCCCTGGAA GGAATAHCC AGATTCCCTG AGTAGTTTCC 300 

AGGTTAAAAT CCTATAGGCT TCTTCTGHT TGAGGAAGAG TTCCTGTCAG AGAAAAACAT 360 

GATinCGAT TTTTAACTTT AATGCTTGTG AAAC6CTATA AAAAAAATH TCTACCCCTA 420 

6CTTTAAAGT ACTGTTAGTG AGAAATTAAA ATTCCTTCAG GAGGATTAAA CTGCCATTTC 480 

AGTTACCCTA ATTCCAAATG TTTTGGTGGT TAGAATCTTC TTTAATGTTC TTGAAGAAGT 540 

GTTTTATATT TTCCCATCNA GATAAATTCT CTCNCNCCTT NNTTTTNTNT CTNNTTTTTT 600 

AAAACGGANT CHGCTCCGT TGTCCANGCT GGGAAHTTN TTTTGGCCAA TCTCCGCTNC 660 

CTTGCAANAA TNCTGCNTCC CAAAATTACC NCCTTTTTCC CACCTCCACC CCNNGGAATT 720 

ACCTGGAATT ANAGGCCCCC NCCCCCCCCC CGGCTAATTT GTHnGITT HAGTAAAAA 780 

ACGGGTTTCC TGTTTTAGTT A6GATGGCCC ANNTCTGACC CCNTNATCNT CCCCCTCNGC 840 

CCTCNAATNT TNGGNMTANG GCTTACCCCC CCCNGNNGTT TTTCCTCCAT TNAAATHTC 900 

TNTGGANTCT TGAATNNCGG GTTTTCCCTT TTAAACCNAT TTTTTTTTTN NNNCCCCCAN 960 
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HTTNCCTCC CCCNTNTNTA ANGGGGGTTT CCCAANCCGG GTCCNCCCCC ANGTCCCCM 1020 

TTTTTCTCCC CCCCCCTCTT HTTCTTTNC CCCAAAANTC CTATCTTTTC CTNNAAATAT 1080 

CNANTNT 1087 
(2) INFORMATION FOR SEQ ID NO: 5; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 

TCTAGACCAA GAAATGGGAG GATTTTAGAG TGACTGATGA TTTCTCTATC ATCTGCAGTT 60 

AGTAAACATT CTCCACAGTT TATGCAAAAA GTAACAAAAC CACTGCAGAT GACAAACACT 120 

AGGTAACACA CATACTATCT CCCAAATACC TACCCACAAG CTCAACAATT TTAAACTGTT 180 

AGGATCACTG GCTCTAATCA CCATGACATG AGGTCACCAC CAAACCATCA AGCGCTAAAC 240 

AGACAGAATG TTTCCACTCC TGATCCACTG TGTGGGAAGA AGCACCGAAC TTACCCACTG 300 

GGGG6CCTGC NTCANAANAA AA6CCCATGC CCCCGGGTNT NCCTTTNAAC CGGAAC6AAT 360 

NAACCCACCA TCCCCACANC TCCTCTGTTC NTGGGCCCTG CATCTTGTGG CCTCNTNTNC 420 

TTTNGG6GAN ACNT6GGGAA GGTACCCCAT TTCNTTGACC CCNCNANAAA ACCCCNGTGG 480 

CCCTTTGCCC TGATTCNCNT GGGCCTTTTC TCHTTCCCT TTTGGGTTGT TTAAATTCCC 540 

AATGTCCCCN GAACCCTCTC CNTNCTGCCC AAAACCTACC TAAATTNCTC NCTANGNNH 600 
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TTCTTGGTGT TNCTTTTCAA AGGTNACCTT NCCTGTTCAN NCCCNACNAA AATTTNTTCC 660 

NTATNNTG6N CCCNNAAAAA NNNATCNNCC CNAATTGCCC GAATTGGTTN GGTTTTTCCT 720 

NCTGGGGGAA ACCCTTTAAA TTTCCCCCTT 6GCCGGCCCC CCTTHTTCC CCCCmNGA 780 

A6GCAGGNGG TTCTTCCCGA ACTTCCAATT NCAACAGCCN TGCCCATTGN TGAAACCCTT 840 

TTCCTAAAAT TAAAAAATAN CCGGTTNNGG NN6GCCTCTT TCCCCTCCNG GNGGGNNGNG 900 

AAANTCCTTA CCCCNAAAAA G6TT6CTTAG CCCCCNGTCC CCACTCCCCC NGGAAAAATN 960 

AACCHTTCN AAAAAAGGAA TATAANTTTN CCACTCCTTN GTTCTCTTCC 1010 
(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEO ID N0:6: 

TCTAGA6CTC GCGGCCGCGA 6CTCTAATAC GACTCACTAT A6GGCGTCGA CTCGATCTCA 60 

GCTCACTGCA ATCTCT6CCC CCGGGGTCAT GCGATTCTCC TGCCTCA6CC TTCCAAGTAG 120 

CTGGGATTAC AGGC6TGCAA CACCACACCC GGCTAATHT GTATTTTTAA TAGA6ATGGG 180 

GTTTTCCCTT GTTGGCCANN ATGGTCTCNA ACCCCT6ACC TCNNGTGATC CCCCCNCCCN 240 

NGANCTCNNA CTGCTGG66A TNNCCGNNNN NNNCCTCCCN NCNCNNNNNN NCNCNNTCCN 300 
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TNNTCCTTNC TCNNNNNNNN CNNTCNNTCC NNCTTCTCNC CNNNTNTTNT CNNCNNCCNN 360 

CNNNCCNCNT NCCCNCNNNT TCNCNTNCNN TNTCCNNCNN NNTCNMCNNN CNNNfCNTNN 420 

CCNNTACNTC NTNNNCNNNT CCNTCTNTNN CCTCNNCNNT CNCTNCNCNT TNTCTCCTCN 480 

NTNNNNNNCT CCNNNNNTCT CNTCNCNNCN TNCCTCNNTN NCCNCNCCCC NCCTCNCNNC 540 

CTNNTTTNNN CNNCNMNTCC NTNCCNTTCN NNTCCNNTNN CNNCNTCNCN NNCNnNTTC 600 

CCNCCNNTTC CTTNCNCNTN NNNTNTCNNN CNCNTCNNTC NTTTNCTCCT NNNTCCCNNC 660 

TCNNTTCNCC CNNNTCCNCC CCCCNCCTNT CTCTCNCCCN NNTNNNTNTN NNNCNTCCNC 720 

TNTCNCNTIC NTCNNTNCNT TNCTNTCNNC NNCNNTNCNC TNCCNTNTNT CTNNNTCNCN 780 

TCNCNTNTCN CCNTCCNTTN CTNTCTCCTN TNTCCTTCCC CTCNCCTNCT CNTTCNCCNC 840 

CCNNTNTNTN TNNCNCCNNT NCTNNNCNNC CNTCNTTTCN TCTCTNCTNN NNNTNNCCTC 900 

NNCCCNTNCC CTNNTNCNCT NCTNNTACCN TNCTMCTCCN TCTTCCTTCC 950 
(2) INFORMATION FOR SEQ ID NO: 7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TCTAGAGCTC 6C6GCCGCGA GCTCAATTAA CCCTCACTAA AGG6AGTCGA CTCGATCAGA 
CTGTTACTGT GTCTATGTAG AAAGAAGTAG ACATAAGAGA TTCCATTTTG TTCTGTACTA 



60 



120 
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AGAAAAATTC TTCTGCCTTG AGATGCTGTT AATCTGTAAC CCTAGCCCCA ACCCTGT6CT 180 

CACAGAGACA TGTGCTGTGT TGACTCAAGG TTCAATGGAT TTAGGGCTAT GCTTTGTTAA 240 

AAAAGTGCTT GAAGATAATA TGCTTGTTAA AAGTCATCAC CATTCTCTAA TCTCAAGTAC 300 

CCAGGGACAC AATACACT6C 6GAAG6CCGC AGGGACCTCT GTCTAG6AAA GCCAGGTATT 360 

6TCCAAGATT TCTCCCCATG TGATAGCCTG AGATATGGCC TCATGGGAAG G6TAAGACCT 420 

GACTGTCCCC CA6CCCGACA TCCCCCAGCC CGACATCCCC CAGCCCGACA CCCGAAAAGG 480 

GTCTGTGCTG AGGAAGATTA NTAAAAGAGG AAGGCTCTTT GCATTGAAGT AAGAAGAAGG 540 

CTCTGTCTCC TGCTCGTCCC TGGGCAATAA AATGTCTTGG TGTTAAACCC GAATGTATGT 600 

TCTACTTACT 6AGAATAGGA GAAAACATCC TTAGGGCTGG AGGTGAGACA CCCTGGCGGC 660 

ATACT6CTCT HAATGCACG AGATGTHGT NTAATTGCCA TCCAGGGCCA NCCCCTTTCC 720 

TTAACTTTTT ATGANACAAA AACTTTGTTC NCTTTTCCTG CGAACCTCTC CCCCTATTAN 780 

CCTATTGGCC TGCCCATCCC CTCCCCAAAN GGTGAAAANA TGTTCNTAAA TNCGAGGGAA 840 

TCCAAAACNT TTTCCCGTTG GTCCCCTTTC CAACCCCGTC CCTGGGCCNN TTTCCTCCCC 900 

AACNTGTCCC GGNTCCHCN TTCCCNCCCC CTTCCCNGAN AAAAAACCCC GTNTGANGGN 960 

GCCCCCTCAA ATTATAACCT TTCCNAAACA AANNGGTTCN AAGGTGGTTT GNTTCCGGTG 1020 

CGGCTGGCCT T6A6GTCCCC CCTNCACCCC AATTTGGAAN CCNGTTTTTT TTATT6CCCN 1080 

NTCCCC 1086 
(2) INFORMATION FOR SEQ ID N0:8: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
NCCNTTTAGA TGTT6ACAAN NTAAACAAGC NGCTCAGGCA GCTGAAAAAA GCCACTGATA 60 

AAGCATCCTG GAGTATCAGA GTTTACTGTT AGATCAGCCT CATTTGACH CCCCTCCCAC 120 

ATGGTGTTTA AATCCAGCTA CACTACTTCC TGACTCAAAC TCCACTATTC CTGTTCATGA 180 

CTGTCAGGAA CTGTTGGAAA CTACTGAAAC TGGCCGACCT GATCTTCAAA ATGTGCCCCT 240 

AGGAAAGGTG GATGCCACCG TGTTCACAGA CAGTACCNCC TTCCTCGAGA AGGGACTACG 300 

AGQGGCCGGT GCANCTGTTA CCAAGGAGAC TNATGTGTTG TGGGCTCAGG CHTACCANC 360 

AAACACCTCA NCNCNNAAGG CTGAATTGAT CGCCCTCACT CAGGCTCTCG GATGGGGTAA 420 

GQGATATTAA CGTTAACACT GACAGCAGGT ACGCCTTTGC TACTGTGCAT GTACGTGGAG 480 

CCATCTACCA GGAGCGTGGG CTACTCACTC GGCAGGTGGC TGTNATCCAC TGTAAANGGA 540 

CATCAAAAGG AAAACNNGGC TGTTGCCCGT GGTAACCANA AANCTGATCN NCAGCTCNAA 600 

6ATGCTGTGT TGACTTTCAC TCNCNCCTCT TAAACTTGCT GCCCACANTC TCCTTTCCCA 660 

ACCAGATCTG CCTGACAATC CCCATACTCA AAAAAAAAAN AANACTGGCC CCGAACCCNA 720 
ACCAATAAAA ACGGGGANGG TNG6TNGANC NNCCTGACCC AAAAATAATG GATCCCCCGG 780 
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GCTGCAGGAA HCAATTCAN CCTTATCNAT ACCCCCAACN N6GNGGGGGG GGCCNGTNCC 840 

CATTNCCCCT NTATTNATTC TTTNNCCCCC CCCCCGGCNT CCTTTTTNAA CTCGTGAAAG 900 

GGAAAACCTG NCTTACCAAM TTATCNCCTG GACCNTCCCC TTCCNCGGTN GNTTANAAAA 960 

AAAAGCCCNC ANTCCCNTCC NAAATHGCA CNGAAAGGNA AGGAATTTAA CCmATTTT 1020 

TTNNTCCTTT ANTTTGTNNN CCCCCTTTTA CCCAGGCGAA CNGCCATCNT TTAANAAAAA 1080 

AAANAGAANG TTTAHTnC CTTNGAACCA TCCCAATANA AANCACCCGC NGGGGAACGG 1140 

GGNGGNAGGC CNCTCACCCC CHTNTGING GNGGGNC 1177 
(2) INFORMATION FOR SEQ ID N0:9; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

NCCNNTTNNT GATGTTGTCT THTGGCCTC TCTTTGGATA CTTTCCCTCT CHCAGAGGT 60 

GAAAAGGGTC AAAAGGAGCT GTTGACAGTC ATCCCAGGTG GGCCAATGTG TCCAGA6TAC 120 

AGACTCCATC AGTGAGGTCA AAGCCTGGGG CTTTTCAGAG AAGGGAGGAT TATGGGTTTT ISO 

CCAATTATAC AAGTCAGAAG TAGAAAGAAG GGACATAAAC CAGGAAGGQG GTGGAGCACT 240 

CATCACCCAG AGGGACTTGT GCCTCTCTCA GTGGTAGTAG AGGGGCTACT TCCTCCCACC 300 

ACGGnGCAA CCAAGAGGCA ATGGGTGATG AGCCTACAGG GGACATANCC GAGGAGACAT 360 
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GGGATGACCC I AAGGGAGTA GGCIGGHTT AAGGCGGTGG GACTGGGTGA GGGAAACTCT 420 

CCTCTTCTTC A6AGAGAA6C AGTACAGGGC GAGCTGAACC GGCTGAA6GT CGAGGCGAAA 480 

ACACGGTCTG GCTCA6GAAG ACCTTGGAAG TAAAAHATG AATGGTGCAT GAATGGAGCC 540 

ATGGAAGGGG TGCTCCTGAC CAAACTCAGC CATTGATCAA TGTTAGGGAA ACTGATCAGG 600 

GAAGCC6GGA ATTTCATTAA CAACCCGCCA CACAGCTTGA ACATTGTGAG GTTCAGTGAC 660 

CCTTCAAGGG GCCACTCCAC TCCAACHTG GCCATTCTAC TTTGCNAAAT TTCCAAAAC1 720 

TCCTTTHTA AGGCCGAATC CNTANTCCCT NAAAAACNAA AAAAAATCTG CNCCTATTCT 780 

66AAAAGGCC CANCCCTTAC CAGGCTGGAA GAAATTTTNC CTTTTTTTTT TTTTTGAAGG 840 

CNTTTNTTAA ATTGAACCTN AATTCNCCCC CCCAAAAAAA AACCCNCCNG GGGGGCGGAT 900 

TTCCAAAAAC NAATTCCCTT ACCAAAAAAC AAAAACCCNC CCTTNTTCCC HCCNCCCTN 960 

TTCTTTTAAT TAGGGAGAGA TNAAGCCCCC CAATTTCCNG GNCTNGATNN GTTTCCCCCC 1020 

CCCCCATTTT CCNAAACTTT HCCCANCNA GGAANCCNCC CnTTTTTNG GTCNGATTNA 1080 

NCAACCTTCC AAACCAHTT TCCNNAAAAA NTTTGNTNGG NGGGAAAAAN ACCTNNTTTT 1140 

ATAGAN 1146 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEONESS: single 

(D) TOPOLOGY; linear 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:10: 

CTTCATTGGG TACGGGCCCC CTCGAGGTCG ACGGTATCGA TAAGCTTGAT ATCGAATTCC 60 

TGCAGCCCGG GGGATCCACT AGTTCTAGAG TCAGGAAGAA CCACCAACCT TCCTGATTTT 120 

TATTGGCTCT GAGTTCTGAG GCCAGTnTC TTCTTCTGTT GAGTATGCGG GATTGTCAGG 180 

CAGATCTGGC TGTGGAAAGG AGACTGTGGG CAGCAAGTTT AGAGGCGTGA CTGAAAGTCA 240 

CACTGCATCT TGAGCTGCTG AATCAGCTTT CT6GTTACCA CGGGCAACAG CCGIGHTTC 300 

CTTTTGATGT CCTTTACAGT GGATTACAGC CACCTGCTGA GGTGAGTAGC CCACGCTCCT 360 

GGTAGATGGC TCCACGTACA TGCACAGTAG CAAAGGCGTA CCTGCTGTCA GTGTTAACGT 420 

TAATATCCTT ACCCCATCGG AGAGCCTGAG TGAGGGCGAT CAATTCAGCC CTTTTGTGCT 480 

GAGGTGTTTG CTGGTTAAGC CCTGAACCCA CAACACATCT GTCTCCATGG TAACAGCTGC 540 

ACCGG 545 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOrll: 
TCTCCTAGGC TGGGCACAGT GGCTCATACC TGTAATCCTG ACCGTTTCAG AGGCTCAGGT 60 
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GGGGGGATCG CTTGAGCCCA AGATTTCAAG ACTAGTCTG6 GTAACATAGT 6AGACCCTAT 120 

CTCTACGAAA AAATAAAAAA ATGAGCCTGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 180 

AATCGAGCCT AGGAGA 196 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TCTCCTAGGC TTGGGGGCTC TGACTAGAAA TTCAAGGAAC CTGGGATTCA AGTCCAACTG 60 

TGACACCAAC TTACACTGTG GNCTCCAATA AACTGCTTCT TTCCTATTCC CTCTCTATTA 120 

AATAAAATAA GGAAAACGAT GTCTGTGTAT AGCCAAGTCA GNTATCCTAA AAGGAGATAC 180 

TAAGTGACAT TAAATATCAG AATGTAAAAC CTGGGAACCA GGTTCCCAGC CTGGGATTAA 240 

ACTGACAGCA AGAAGACTGA ACAGTACTAC TGTGAAAAGC CCGAAGNGGC AATATGTTCA 300 

CTCTACCGTT GAAGGATGGC TGGGA6AATG AAT6CTCTGT CCCCCAGTCC CAAGCTCACT 360 

TACTATACCT CCTTTATAGC CTAGGAGA 388 
(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 337 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:13: 

TAGTAGTTGC CTATAATCAT GTTTCTCATT ATTTTCACAT TTTATTAACC AATTTCTGTT 60 

TACCCTGAAA AATATGAGGG AAATATATGA AACAGGGAGG CAATGTTCAG ATAATTGATC 120 

ACAAGATATG ATTTCTACAT CAGATGCTCT TTCCTTTCCT GTTTATTTCC TTTTTATTTC 180 

GGTTGTGGGG TCGAATGTAA TAGCTTTGTT TCAAGAGAGA GTTTTG6CAG TTTCTGTAGC 240 

TTCTGACACT GCTCATGTCT CCAGGCATCT ATHGCACTT TAGGAGGTGT CGTGGGAGAC 300 

TGAGAGGTCT ATTTTTTCCA TATTTGGGCA ACTACTA 337 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:14: 

TAGTAGTTGC CATACAGTGC CTTTCCATTT ATTTAACCCC CACCTGAACG GCATAAACTG 60 

AGTGTTCAGC TGGTGTTrTT TACTGTAAAC AATAAGGAGA CTTTGCTCTT CATTTAAACC 120 

AAAATCATAT TTCATATTTT ACGCTCGAGG GTTTTTACCG GTTCCTTTTT ACACTCCTTA 180 
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AAACAGTTTT TAAGTCGTTT GGAACAAGAT ATTTTTTCTT TCCTGGCAGC TTTTAACATT 240 

ATAGCAAATT TGTGTCTGGG GGACTGCTGG TCACTGTTTC TCACAGTTGC AAATCAAG6C 300 

ATTTGCAACC AAGAAAAAAA AATTTTTTTG TTTTATTTGA AACTGGACCG GATAAACGGT 360 

6TTTGGAGCG GCTGCTGTAT ATAGTTTTAA ATGGTTTATT GCACCTCCTT AAGTTGCACT 420 

TATGTGGGGG GGGGNTTHG NATAGAAAGT NTTTANTCAC ANAGTCACAG GGACTTTTNT 480 

CTTTTGGNNA CTGAGCTAAA AAGGGCTGNT TTTCGGGTGG GGGCAGATGA AGGCTCACAG 540 

GAGGCCTTTC TCTTAGAGGG GGGAACTNCT A 571 
(2) INFORMATION FOR SEQ ID N0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 

TATATATTTA ATAACTTAAA TATATTTTGA TCACCCACTG GGGTGATAAG ACAATAGATA 60 

TAAAAGTATT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 120 

TCCCCCACCC GCACTGAAAC TTCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 180 

AAGTCTTTGG TGCGTGCTCA CTACTCTTTT TTTTTTTTTT TTTNTTTTGG AGATGGAGTC 240 

TGGCTGTGCA GCCCAGGGGT GGAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 300 

CTCCCAGGTT CATGAGATTC TCCTGNTTCA GCCTTCCCAG TAGCTGGGAC TACAGGTGTG 360 
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CATCACCATG CCTGGNTAAT CTTTTTTNGT TTTNGGGTAG AGATGGGGGT TTTACATGTT 420 

GGCCAGGNTG GTNTCGAACT CCTGACCTCA AGTGATCCAC CCACCTCAGG CTCCCAAAGT 480 

GCTAGGATTA CAGACATGA6 CCACTGNGCC CAGNCCTGGT GCAT6CTCAC TTCTCTAGGC 540 



AACTACTA 

(2) INFORMATION FOR SEQ ID NO: 16: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



548 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TTCC6TTAT6 CACATGCAGA ATATTCTATC GGTACTTCAG CTATTACTCA TTTTGATGGC 60 

GCAATCCGAG CCTATCCTCA AGATGAGTAT TTAGAAAGAA TTGATTTAGC GATAGACCAA 120 

GCTGGTAAGC ACTCTGACTA CACGAAATTG TTCAGATGTG ATGGATTTAT 6ACAGTTGAT 180 

CTTTGGAAGA GATTATTAAG TGATTATTTT AAAGGGAATC CATTAATTCC AGAATATCTT 240 

GGTTTAGCTC AAGATGATAT AGAAATAGAA CAGAAAGAGA CTACAAATGA AGATGTATCA 300 

CCAACTGATA TTGAAGAGCC TATAGTAGAA AATGAATTAG CTGCAHTAT TAGCCTTACA 360 

CATAGCGATT TTCCTGATGA ATCTTATATT CAGCCATCGA CATAGCATTA CCTGATGGGC 420 

AACCTTACGA ATAATAGAAA CTGGGTGCGG GGCTAHGAT GAATTCATCC NCAGTAAATT 480 
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TGGATATNAC AAAATATAAC TCGATTGCAT TTGGATGATG GAATACTAAA TCTGGCAAAA 540 
GTAACTTTGG AGCTACTAGT AACCTCTCTT TTTGAGATGC AAAATTTTCT TTTAGGGTTT 600 
CTTATTCTCT ACTTTACG6A TATTGGAGCA TAACGGGA 638 
(2) INFORMATION FOR SEQ 10 NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 17: 

ACTGATGGAT GTCGCCGGAG GCGAGGGGCC TTATCTGATG CTC6GCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 120 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGIGCI 240 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 286 
(2) INFORMATION FOR SEQ ID N0:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TCGGTCATAG CAGCCCCTTC TTCTCAATTT CATCTGTCAC TACCCT66TG TAGTATCTCA 60 

TAGCCTTACA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC 120 

CATATCACAC ATAACTGCAA GTAAACATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT 180 

GTGNCAAGAA ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 240 

TCACTCTTCA CCTATGACCG AA 262 
(2) INFORMATION FOR SEO ID NO: 19: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:19: 
TCGGTCATAG CAAAGCCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 60 
ATTTATGATA AATGGTGGCA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT 120 
AACTCTGAGA TATAHCTTC TACATTTAAA CAATAAAAAT AATCTATHT TAAAAGCCTA 180 
ATTTGCGTAG TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 240 
CGTTTCTCTG CCTATGACCG A 261 
(2) INFORMATION FOR SEQ ID N0:20: 

(1) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 

TACAACGAGG CGACGTCGGT AAAATCGGAC AT6AAGCCAC CGCTGGTCTT TTCGTCCGAG 60 

CGATAGGCGC CGGCCAGCCA GCGGAACGGT TGCCCGGATG GCGAAGCGAG CCG6AGTTCT 120 

TCGGACTGAG TATGAATCTT GTTGTGAAAA TACTCGCCGC CTTCGTTCGA CGACGTCGCG 180 

TCGAAATCTT CGANCTCCTT ACGATCGAAG TCTTCGTGGG CGAC6ATCGC GGTCAGTTCC 240 

GCCCCACCGA AATCATGGTT GAGCCGGATG CTGNCCCCGA AGNCCTCGTT TGTN 294 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION; SEQ ID N0:21: 

TTGGTAAA6G GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

ATCAATGAAT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

6TTCTCATGG ANAAGGCAAN 6AGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 



CAACTAGTCG NCTTGCNANG ATCTTCAT 



208 
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(2) INFORMATION FOR SEO ID N0:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

NCCNTTGAGC TGAGTGATTG AGATNTGTAA TGGTTGTAAG GGTGATTCAG GCGGATTAGG 60 

GTGGCGGGTC ACCCGGCAGT GGGTCTCCCG ACAGGCCAGC AGGATTTGGG GCAGGTACGG 120 

NGTGCGCATC GCTCGACTAT AT6CTATGGC AGGCGAGCCG TGGAAGGNGG ATCAGGTCAC 180 

GGCGCTGGAG CTTTCCACGG TCCATGNATT 6NGATGGCTG TTCTAGGCGG CTGTTGCCAA 240 

GCGTGATGGT ACGCTGGCTG GAGCATTGAT TTCTGGTGCC AAGGTGG 287 
(2) INFORMATION FOR SEQ ID N0:23: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:23: 
TTGGGTAAAG GGAGCAAGGA GAAGGCATGG AGAGGCTCAN GCTGGTCCTG GCCTACGACT 60 
GGGCCAAGCT GTCGCCGGGG ATGGTGGAGA ACTGAAGCGG GACCTCCTCG AGGTCCTCCG 120 
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NCGTTACTTC NCCGTCCAGG AGGAGGGTCT TTCCGTGGTC TNGGAGGAGC GGGGGGAGAA 180 
GATNCTCCTC ATGGTCNACA TCCC 204 
(2) INFORMATION FOR SEQ ID N0:24; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

TGGATTGGTC AGGAGCGGGT A6AGTGGCAC CATTGAGGGG ATATTCAAAA ATATTATTTT 60 

GTCCTAAATG ATAGTTGCTG AGTTTTTCTT TGACCCATGA GTTATATTGG AGTTTATHT 120 

TTAACTTTCC AATCGCATGG ACATGHAGA CTTATTTTCT GTTAATGATT NCTATTTTTA 180 

TTAAATTGGA TTTGAGAAAT TGGTTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG 240 



ACATTATAGC TTAGTATGTG ACCA 

(2) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



264 



Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 
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TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 60 

TGCACCCGCA ATCCCAGCTA CHGGGAGGT TGAGACACAA GANTCACCTA NATGTGGGAG 120 

GTCAAGGHG CATGAGTCAT GATTGT6CCA CTGCACTCCA GCCTG6GTGA CAGACCGAGA 180 

CCCTGCCTCA ANAGANAANG AATAGGAAGT TCA6AAATCN TGGNTGTGGN GCCCAGCAAT 240 

CTGCATCTAT NCAACCCCTG CAGGCAANGC TGATGCAGCC TANGTTCAAG AGCTGCTGTT 300 

TCTGGAGGCA GCAGTTNGGG CTTCCATCCA GTATCACGGC CACACTCGCA CNAGCCATCT 360 

GTCCTCCGTN TGTNAC 375 
(2) INFORMATION FOR SEQ ID N0:26: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:26: 

TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 60 

TGCACCTGTA ATCCCAGCTA CTTGGGCGGC TGAGACACAA GAACCACCTA AATGTGGGAG 120 

GGTCAAGGTT 6CATGAGTCA TGATCGCGCC ACTGCACTCC AGCCTGGGTG ACAGACTGAG 180 

ACCCTGCCTC AAAAGAAAAA GAATAGGAAG TTCAGAAACC CTGGGTGTGG NGCCCAGCAA 240 

TCTGCAHTA AACAATCCCT GCAGGCAATG CTGATGCAGC CTAAGTTCAA GAGCTGCTGT 300 
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TCTGGAGGCA GNAGTAAGGG CTTCCATCCA GCATCACGGN CAACACTGCA AAAGCACCTG 360 
TCCTCGTTGG TA 372 
(2) INFORMATION FOR SEQ 10 NO:27: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:27: 

TTCTGTCCAC ATCTACAAGT TTTATTTATT TTGTGGGTTT TCAGGGTGAC TAAGTTTTTC 60 

CCTACATTGA AAAGAGAAGT TGCTAAAAGG TGCACAG6AA ATCATTTTTT TAAGTGAATA 120 

TGATAATATG GGTCCGTGCT TAATACAACT GAGACATATT TGTTCTCTGT TTTTTTAGAG 180 

TCACCTCTTA AAGTCCAATC CCACAATGGT GAAAAAAAAA TAGAAA6TAT TTGTTCTACC 240 

TTTAAGGAGA CTGCAGGGAT TCTCCTTGAA AACGGAGTAT GGAATCAATC TTAAATAAAT 300 

ATGAAATTGG TTGGTCTTCT GGGATAAGAA ATTCCCAACT CAGTGTGCTG AAATTCACCT 360 

GACTTTTTTT GGGAAAAAAT AGTCGAAAAT GTCAATTTGG TCCATAAAAT ACATGTTACT 420 

ATTAAAAGAT ATTTAAAGAC AAATTCTTTC AGAGCTCTAA 6ATTGGTGTG GACAGAA 477 
(2) INFORMATION FOR SEQ ID N0:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS ; single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:28: 

TCTNCMCCT CTTGANTGTC AAAAACCTTN TAGGCTATCT CTAAAAGCTG ACTGGTATTC 60 

ATTCCAGCAA AATCCCTCTA GTTTHGGAG TTTCCHTTA CTATCTGGGG CTGCCTGAGC 120 

CACAAATCCC AAATTAAGAG CATGGCTATT TTCGGGGGCT GACAGGTCAA AAGGGGTGTA 180 

AATCCGATAA GCCTCCTGGA GGTGCTCTAA AAACACTCCT 6GTGACTCAT CAT6CCCCTG 240 

GACGACnCA ATCGNCTTAG ACAAGTTTAT AGGTHCTGG GCAGCTCCCT GAATACCCAC 300 

GAGGAGATAC CGGTGGAAAT CGTCAAAAGT TCTCCCTCCA CTTGAGAAAT TTGGGTCCCA 360 

ATTAG6TCCC AATTGGGTCT CTAATCACTA TTCCTCTAGC TTCCTCCTCC GGNCTATTGG 420 

TTGATGTGAG GTTGAAGA 438 
(2) INFORMATION FOR SEQ ID N0;29: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:29: 
AAGAGGGTAC CAGCCCCAAG CCTTGACAAC TTCCATAGGG TGTCAA6CCT GTGGGTGCAC 60 
AGAAGTCAAA AATTGAGTTT T6GGATCCTC AGCCTAGATT TCAGAGGATA TAAAGAAACA 120 
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CCTAACACCT AGATATTCAG ACAAAAGTTT ACTACAGGGA TGAAGCTTTC ACGGAAAACC 180 

TCTACTAGGA AAGTACAGAA GAGAAATGT6 GGTTTGGAGC CCCCAAACA6 AATCCCCTCT 240 

AGAACACTGC CTAATGAAAC TGTGAGAAGA TGGCCACTGT CATCCAGACA CCAGAATGAT 300 

AGACCCACCA AAAACTTATG CCATATTGCC TATAAAACCT ACAGACACTC AATGCCAGCC 360 

CCATGAAAAA AAAACTGAGA AGAAGACTGT NCCCTACAAT GCCACCGGAG CAGAACTGCC 420 

CCAGGCCATG GAAGCACAGC TCTTATATCA ATGTGACCTG GATGTTGAGA CATGGAATCC 480 

NANGAAATCN TTTTAANACT TCCACGGTTN AATGACTGCC CTATTANATT CNGAACTTAN 540 

ATCCNGGCCT GTGACCTCTT TGCTTTGGCC ATTCCCCCTT TTTGGAATG6 CTNTTTTTTT 600 

CCCATGCCTG TNCCCTCTTA 620 
(2) INFORMATION FOR SEQ 10 N0:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:30: 
TTACAACGAG GGGGTCAATG TCATAAATGT CACAATAAAA CAATCTCTTC TTTTTHTTT 60 
TTTTTTTTTT TTTHTTTTT TTTTTTTHT TTTTTTTTTT 100 
(2) INFORMATION FOR SEQ ID NO: 31: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 M0:31: 

TAGTCTATGC GCCGGACAGA 6CAGAATTAA ATTGGAAGTT GCCCTCCGGA CTTTCTACCC 60 

ACACTCTTCC TGAAAAGAGA AAGAAAAGAG GCAGGAAAGA GGTTAGGATT TCATTTTCAA 120 

GAGTCAGCTA ATTAG6AGAG CAGAGTTTAG ACAGCA6TAG GCACCCCATG ATACAAACCA 180 

TGGACAAAGT CCCTGTTTAG TAACTGCCAG ACATGATCCT GCTCAGGHT TGAAATCTCT 240 

CTGCCCATAA AAGATGGAGA GCAGGA6TGC CATCC«:ATC AACACGTGTC CAAGAAAGAG 300 

TCTCAGGGAG ACAAGGGTAT CAAAAAACAA GATTCTTAAT GGGAAQGAAA TCAAACCAAA 360 

AAATTAGATT TTTCTCTACA TATATATAAT ATACAGATAT TTAACACATT ATTCCAGAGG 420 

TGGCTCCAGT CCTTGGGGCT TGAGAGATGG TGAAAACTTT TGTTCCACAT TAACTTCTGC 480 

TCTCAAATTC TGAAGTATAT CAGAATGGGA CAGGCAATGT TTTGCTCCAC ACTGGGGCAC 540 

AGACCCAAAT GGTTCTGTGC CCGAAGAAGA GAAGCCCGAA AGACATGAAG GATGCTTAAG 600 

GGGGGTTGGG AAAGCCAAAT TGGTANTATC THTCCTCCT GCCTGTGTTC CNGAAGTCTC 660 

CNCTGAAGGA ATTCTTAAAA CCCTTTGTGA GGAAATGCCC CCHACCATG ACAANTGGTC 720 

CCATTGCTTT TAGGGNGATG GAAACACCAA GGGTHTGAT CC 762 
(2) INFORMATION FOR SEQ ID N0:32: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 

TAGTCTATGC GTGTATTAAC CTCCCCTCCC TCAGTAACAA CCAAAGAGGC AGGAGCTGTT 60 

ATTACCAACC CCATTTTACA GATGCATCAA TAATGACAGA GAAGTGAAGT GACTTGCGCA 120 

CACAACCAGT AAATTGGCAG AGTCAGATTT GAATCCATGG AGTCTGGTCT GCACTTTCAA 180 

TCACCGAATA CCCTTTCTAA 6AAACGTGTG CTGAATGAGT GCATGGATAA ATCAGTGTCT 240 

ACTCAACATC TTTGCCTAGA TATCCCGCAT AGACTA 276 
(2) INFORMATION FOR SEQ ID N0:33: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:33: 
TAGTAGTTGC CAAATATTTG AAAATTTACC CAGAAGTGAT TGAAAACTH TTGGAAACAA 60 
AAACAAATAA AGCCAAAAGG TAAAATAAAA ATATCTTTGC ACTCTCGTTA TTACCTATCC 120 



ATAACTTTTT CACCGTAAGC TCTCCTGCTT GTTAGTGTAG TGTGGTTATA HAAACHTT 



180 
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TAGTTATIAT TTTTTATTCA CTTTTCCACT AGAAAGTCAT TATTGATTTA GCACACATGT 240 

TGATCTCAH TCATTnTTC TTTTTATAGG CAAAAHTGA TGCTATGCAA CAAAAATACT 300 

CAAGCCCATT ATCHTTTTC CCCCCGAAAT CTGAAAATTG CAGGGGACAG AGGGAAGHA 360 

TCCCATTAAA AAATTGTAAA TATGHCAGT TTATGTTTAA AAATGCACAA AACATAAGAA 420 

AATTGTGTTT ACTTGAGCTG CTGATTGTAA GCAGTTTTAT CTCAGG6GCA ACTACTA 477 
(2) INFORMATION FOR SEQ ID N0:34: 

(i) SEQUENCE CHARACTERISTICS. 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID M0:34: 

TAGTAGTTGC CAATTCAGAT GATCAGAAAT GCTGCTTTCC TCAGCATTGT CTTGTTAAAC 60 

CGCATGCCAT TTGGAACTTT 6GCAGTGAGA AGCCAAAAGG AAGAG6TGAA TGACATATAT 120 

ATATATATAT ATTCAATGAA AGTAAAATGT ATATGCTCAT ATACHTCTA GTTATCAGAA 180 

TGAGTTAAGC TTTATGCCAT TGGGCTGCTG CATATHTAA TCAGAAGATA AAAGAAAATC 240 

TGGGCATDT TAGAATGTGA TACATGTHT TTTAAAACTG HAAATATTA TTTCGATATT 300 

TGTCTAAGAA CCGGAATGH CTTAAAATTT ACTAAAACAG TATTGTrTGA GGAAGAGAAA 360 

ACTGTACTGT TTGCCATTAT TACAGTCGTA CAAGTGCATG TCAAGTCACC CACTCTCTCA 420 
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GGCATCAGTA TCCACCTCAT AGCTTTACAC ATTTTGACGG GGAATATTGC AGCATCCTCA 480 

GGCCTGACAT CTGGGAAAGG CTCAGATCCA CCTACTGCTC CTTGCTCGTT GATTTGTTTT 540 

AAAATATTGT GCCTGGTGTC ACHTTAAGC CACAGCCCTG CCTAAAAGCC AGCAGAGAAC 600 

AGAACCCGCA CCATTCTATA GGCAACTACT A 631 
(2) INFORMATION FOR SEQ ID N0:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SFQ ID N0:35: 

TAGTAGTTGC CATCCCATAT TACAGAAGGC TCTGTATACA TGACTTATTT GGAAGTGATC 60 

TGTTTTCTCT CCAAACCCAT TTATCGTAAT TTCACCAGTC TTGGATCAAT CTTGGTTTCC 120 

ACTGATACCA TGAAACCTAC TTGGAGCAGA CATTGCACAG TTTTCTGTGG TAAAAACTAA 180 

AGGTTTATTT GCTAAGCTGT CATCTTATGC TTAGTAHTT HTTTTACAG TGG6GAATTG 240 

CTGAGATTAC ATTTTGTTAT TCATTAGATA CTTTGGGATA ACTTGACACT GTCTTCTTTT 300 

TTTCGCTTTT AATTGCTATC ATCATGCTTT TGAAACAAGA ACACATTAGT CCTCAAGTAT 360 

TACATAAGCT TGCTTGTTAC GCCTGGT6GT TTAAA6GACT ATCTTTGGCC TCAGGTTCAC 420 

AAGAATGGGC AAAGTGTTTC CTTATGTTCT GTAGTTCTCA ATAAAAGATT GCCAGGGGCC 480 

GGGTACTGTG GCTCGCACTG TAATCCCA6C ACTTTGGGAA GCTGAGGCTG GCGGATCAT6 540 
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TTAGGGCAGG TGTTCGAAAC CA6CCTGG6C AACTACTA 



578 



(2) INFORMATION FOR SEQ ID N0:36: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



TAGTAGTTGC CTGTAATCCC AGCAACTCAG GAGGCTGGGG CAGGAGAATC A6TTGAACCT 60 



GG6AGGCAGA AGTTGTAATT AGCAAAGATC GCACCATTGC ACTTCAGCCT GGGCAACAAG 120 



AGTGAGATTC CATCTCAAAA ACAAAAAAAA GAAAAAGAAA AGAAAAGGAA AAAACGTATA 180 



AACCCAGCCA AAACAAAATG ATCATTCTH TAATAAGCAA GACTAATTTA AT6TGTTTAT 240 



TTAATCAAAG CAGTTGAATC TTCTGAGHA TTGGTGAAAA TACCCATGTA GTTAATTTAG 300 



GGITCHACT TGGGTGAACG TTTGATGTTC ACAGGTTATA AAATGGTTAA CAAGGAAAAT 360 



GATGCATAAA GAATCTTATA AACTACTAAA AATAAATAAA ATATAAATGG ATA6GT6CTA 420 



TGGATGGAGT TTTTGT6TAA TTTAAAATCT TGAAGTCATT TTGGATGCTC ATTGGTT6TC 480 



TGGTAATTTC CATTAGGAAA AGGTTATGAT ATGGGGAAAC TGTTTCTG6A AATTGCGGAA 540 



TGTTTCTCAT CTGTAAAATG CTAGTATCTC AGGGCAACTA CTA 



583 



(2) INFORMATION FOR SEQ ID N0:37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:37: 
GATCTACTAG TCATNTGGAT TCTATCCATG GCAGCTAAGC CTTTCTGAAT GGATTCTACT 60 

GCTTTCTTGT TCTTTAATCC AGACCCTTAT ATATGHTAT GTTCACAGGC AGGGCAATGl 120 

TTAGTGAAAA CAATTCIAAA TTTTTTAHT TGCATTTTCA TGCTAATTTC CGTCACACTC 180 

CAGCAGGCTT CCTGGGAGAA TAAGGAGAAA TACAGCTAAA GACATTGTCC CTGCTTACTT 240 

ACAGCCTAAT GGTATGCAAA ACCACTTCAA TAAA6TAACA GGAAAAGTAC TAACCAGGTA 300 

GAATGGACCA AAACTGATAT AGAAAAATCA GAGGAAGAGA GGAACAAATA TTTACTGAGT 360 

CCTA6AATGT ACAAGGCTTT TTAATTACAT ATTTTATGTA AGGCCTGCAA AAAACAGGTG 420 

AGTAATCAAC ATTTGTCCCA THTACATAl AAGGAAACTG AAGCTTAAAT TGAATAATTT 480 

AATGCATAGA TTTTATAGTT AGACCATGTT CAGGTCCCTA T6TTATACTT ACTAGCTGTA 540 

TGAATATGAG AAAATAATTT TGTTAHTTC TTGGCATCAG TATTTTCATC TGCAAAATAA 600 

AGCTAAAGTT ATTTAGCAAA CAGTCAGCAT AGTGCCTGAT ACATAGTAGG TGCTCCAAAC 660 

ATGATTACNC TANTAHNGG TATTANAAAA ATCCAATATA GGCNTGGATA AAACCG 716 
(2) INFORMATION FOR SEQ ID NO: 38: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 688 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:38: 

TTCTGTCCAC ATATCATCCC ACTTTAATTG TTAATCAGCA AAACTTTCAA TGAAAAATCA 60 

TCCATTTTAA CCA6GATCAC ACCAG6AAAC TGAAGGTGTA TTTTHTTTA CCTTAAAAAA 120 

AAAAAAAAAA ACCAAACAAA CCAAAACAGA TTAACAGCAA AGA6TTCTAA AAAATTTACA ISO 

TTTCTCTTAC AACTGTCATT CAGAGAACAA TAGTTCTTAA GTCTGTTAAA TCTTGGCATT 240 

AACAGAGAAA CTTGATGAAN AGTTGTACH GGAATATTGT GGATTTTTTT TTTTGTCTAA 300 

TCTCCCCCTA TTGTHTGCC AACAGTAATT TAAGTTTGTG TGGAACATCC CCGTAGTTGA 360 

AGTGTAAACA ATGTATAGGA AGGAATATAT GATAAGAT6A TGCATCACAT ATGCATTACA 420 

TGTAGGGACC TTCACAACTT CATGCACTCA GAAAACATGC TTGAAGAGGA GGAGAGGACG 480 

GCCCAG6GTC ACCATCCAGG TGCCTTGAGG ACA6A6AATG CAGAAGTGGC ACTGTTGAAA 540 

TTTAGAAGAC CATGTGTGAA TGGHTCAGG CCTGGGATGT HGCCACCAA GAAGTGCCTC 600 

CGAGAAATTT CHTCCCATT TGGAATACAG G6TGGCTTGA TGGGTACGGT GGGTGACCCA 660 

ACGAAGAAAA TGAAAHCTG CCCTTTCC 688 

(2) INFORMATION FOR SEQ 10 N0:39: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 585 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:39: 

TAGTAGTTGC CGCNNACCTA AAANTTGGAA AGCATGATGT CTAGGAAACA TANTAAAATA 60 

GGGTATGCCT AT6TGCTACA GAGAGATGTT AGCATTTAAA GTGCATANTT TTATGTATTT 120 

TGACAAATGC ATATNCCTCT ATAATCCACA ACTGATTACG AAGCTATTAC AATTAAAAAG 180 

TTTGGCCGGG CGTGGTGGGC GGTGGCT6AC GCCTGTAATC CCAGCACTTT GGGAGGCCGA 240 

GGCACGCGGA TCACGAGGTC GGGA6TTCAA GACCATCCTG GCTAACACGG TGAAAGTCCA 300 

TCTCTACTAA AAATACGAAA AAATTACCCC GGCGTGGTGG CGGGCGCCTG TAGTCCCAGC 360 

TACTCCGGAG GCTGAGGCA6 GAGAATGGCG TGAACCCAGG ACACGGAGCT TGCAGTGTGC 420 

CAACATCACG TCACTGCCCT CCAQCCTGGG GGACAGGAAC AAGANTCCCG TCCTCANAAA 480 

AGAAAAATAC TACTNATANT TTCNACTTTA TTTTAANTTA CACAGAACTN CCTCTTGGIA 540 

CCCCCTTACC ATTCATCTCA CCCACCTCCT ATAGGGCACN NCTAA 585 
(2) INFORMATION FOR SEQ ID N0:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:40: 

TCTGTCCACA CCAATCTTAG AAGCTCTGAA AAGAAHTGT CTTTAMTAT CTTTTAATAG 60 

TAACATGTAT TTTATGGACC AAATTGACAT TTTCGACTGT mTTCCAAA AAAGTCAGGT 120 

GAATTTCAGC ACACTGAGTT GGGAATTTCT TATCCCAGAA GACCAACCAA TTTCATAHT 180 

ATTTAAGATT GATTCCATAC TCCGTTTTCA A6GAGAATCC CTGCAGTCTC CTTAAAGGTA 240 

GAACAAATAC TTCCTATTTT TTTTTCACCA TTGTGGGATT GGACTTTAAG AGGTGACTCT 300 

AAAAAAACAG AGAACAAATA TGTCTCAGTT GTATTAAGCA CGGACCCATA TTATCATATT 360 

CACTTAAAAA AATGATTTCC TGTGCACCTT TTGGCAACTT CTCTTTTCAA TGTAGGGAAA 420 

AACTTAGTCA CCCTGAAAAC CCACAAAATA AATAAAACTT GTAGATGTGG ACAGA 475 
(2) INFORMATION FOR SEQ ID N0:41: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:41: 

TAAGAGGGTA CATC6GGTAA GAACGTAGGC ACATCTAGAG CHAGAGAAG TCTGGGGTAG 60 

GAAAAAAATC TAA6TATTTA TAAGGGTATA GGTAACATTT AAAAGTAGGG CTAGCTGACA 120 

TTATTTAGAA AGAACACATA CGGAGAGATA AGGGCAAAGG ACTAAGACCA GA6GAACACT 180 

AATATTTAGT GATCACTTCC ATTCTTGGTA AAAATAGTAA CTTTTAAGTT AGCTTCAAGG 240 
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AAGATTTHG GCCATGATTA GTT6TCAAAA GTTAGTTCTC TTGGGTTTAT ATTACTAATT 300 

TIGTHTAAG ATCCTT6TTA GTGCTTTAAT AAAGTCATGT TATATCAAAC GCTCTAAAAC 360 

ATTGTAGCAT 6TTAAATGTC ACAATATACT TACCATTTGT TGTATATGGC TGTACCCTCT 420 
CTA 423 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:42: 

TCTCCTAGGC TAATGTGTGT GTTTCTGTAA AAGTAAAAAG TTAAAAATH TAAAAATAGA 60 

AAAAAGCTTA TAGAATAAGA ATATGAAGAA AGAAAATATT THGTACATT TGCACAATGA 120 

GTTTATGTTT TAAGCTAAGT 6TTATTACAA AAGAGCCAAA AAGGTTTTAA AAATTAAAAC 180 

GTTT6TAAAG TTACAGTACC CTTATGTTAA TTTATAATTG AAGAAA6AAA AACTTTTTTT 240 

TATAAATGTA GTGTAGCCTA AGCATACAGT ATTTATAAAG TCTGGCAGTG TTCAATAATG 300 

TCCTAGGCCT TCACATTCAC TCACTGACTC ACCCAGAGCA ACTTCCAGTC CTGTAAGCTC 360 

CATTCGTGGT AAGTGCCCTA TACAGGTGCA CCATTTATTT TACAGTAHT TTACTGTACC 420 

TTCTCTATGT TTCCATAT6T TTCGATATAC AAATACCACT GGTTACTATN GCCCNACAGG 480 
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TAATTCCA61 AACACGGCCT GTATACGTCT GGTANCCCTA GNGAAGA 527 
(2) INFORMATION FOR SEQ ID N0:43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:43: 

TCTTCAACCT CGTAGGACAA CTCTCATATG CCTGGGCACT ATTTTTAGGT TACTACCTTG 60 

GCTGCCCTTC TTTAAGAAAA AAAAAAGAAG AAAAAAGAAC TTHCCACAA GTTTCTCTTC 120 

CTCTAGTTGG AAAATTAGAG AAATCATGn TTTAATTTT6 TGTTATTTCA GATCACAAAT 180 

TCAAACACTT GTAAACATTA AGCHCTGH CAATCCCCTG GGAAGAGGAT TCATTCTGAT 240 

ATTTACGGTT CAAAAGAAGT TGTAATATTG TGCTTGGAAC ACAGA6AACC AGTTATTAAC 300 

TTCCTACTAC TATTATATAA TAAATAATAA C 331 
(2) INFORMATION FOR SEQ ID N0:44: 

( i ) SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 
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GGCTTAGTAG TTGCCAGGCA AAATARCGTT GATTCTCCTC AGGAGCCACC CCCAACACCC 60 

CTGTTTGCTT CTAGACCTAT ACCTAGACTA AAGTCCCAGC AGACCCCTAG AGGTGAGGTT 120 

CAGAGTGACC CTTGA66AGA TGTGCTACAC TAGAAAAGAA CTGCTTGAGT TTTCTAATTT 180 

ATATAAGCAG AAATCTGGAG AAGAGTCATA GGAATGGATA TTAAGGGTGT GAGATAATGG 240 

CGGAA6GAAT ATAGAGTTGG ATCAGGCTGG ACTTATTGAT TTGAACCCAC TAAGTAGAGA 300 

TTCTGCTTTT GATGTTGCAG CTCAGGGAGT TAAAAAAGGT TTTAAIGGH CTAATAGTTT 360 

ATTTGCTTGG TTAGCTGAAA TATGGATAAA AGATGGCCCA CTGTGAGCAA GCTGGAAATG 420 

CCTGATCTCT CTCAGTTTAA TGTAGAGGAA GGGATCCAAA AGTTTAGGGA GANTTGGATG 480 

CTGGRAKTGG ATTGGTCACT TTGRGACCTA CCCWTCCCAG CTGGGAGGGT CCAGAAGATA 540 

CACCCTTGAC CAACGCTTTG CGAAATGGAT TTGTGATGGC GGCAACTACT AA 592 

(2) INFORMATION FOR SEO ID N0:45: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 567 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEO ID N0:45: 

GGCTTAGTAG TTGCCATTGC GAGTGCTTGC TCAACGAGCG TTGAACATGG CGGATTGTCT 60 

A6ATTCAACG GATTTGAGTT TTACCAGCAA AGCGAACCAA GCGCGGCCCA GA6AATTATG 120 

GGTTGGTTGG CTTTGAAAAG ATGGAAATCC TGTAGGCCTA GTCAGAAAAG CCTTCTTGCA 180 
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GAACAGTTGG TTCTCGGGCG AACGCTCATC AAGATGCCCA TTGGAAAGGC TAGCGTGTAT 240 

TTGGGAGAGC CTGATAGCGT GTCTTCTGAT GATGTTTGTG CTTGGACAGT GACAAAAGAT 300 

ATGCAAAGCA AGTCCGAACT AGACGTCAAG CTTCGTGAGC AAATTATTGT AGACTCCTAC 360 

TTATACTGTG AG6AATGATA GCCAAGGGTG GGGACTTTAA GACTAAGGTG GTTTGTACTT 420 

GCGCCGATGA TCCCAGGCAG AAAGAMCTGA TCGCTAGTTT TATACGGGCA ACTACTAAGC 480 

CGAATTCCAG CACACTGGCG GCCGTTACTA ATTGGATCCG ANCTCGGTAC CAGCHGATG 540 

CATASCTTGA GTTWTCTATA NTGTCNC 567 
(2) INFORMATION FOR SEQ ID N0:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 

6AGCGAAAGA CCGAGGGCAG NGNNTANGNG CGANGAAGCG GAGAGGGCCA AAAAGCAACC 60 

GCTTTCCCCG GGGGGTGCCG ATTCAHAAG GCAGGTGGAG GACAGGTTTC CCGATGGAAG 120 

GCGGCAGGGG CGCAA6CAAT TAATGTGAGT AGGCCATTCA TTAGCACCCG GGCHAACAT 180 

TTAAGCTTCG GGTTGGTATG TGGTGGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA 240 

CAGCTATGAC CATGATTACG CCAAGCTAH TAGGTGACAT TATAGAATAA CTCAAGHAT 300 
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GCATCMGCT TGGTACCGAG TTCGGATCCA CTAGTAACGG CCGCCAGT6T GTGGAATTCG 
GCTTAGTAGT TGCCGACCAT GGAGTGCTAC CTAGGCTAGA ATACCTGAGY TCCTCCCTAG 
CCTCACTCAC ATTAAATTGT ATCTTTTCTA CATTAGATGT CCTCAGCGCC TTATTTCTGC 
TGGACWATCG ATAAATTAAT CCTGATAGGA TGATAGCAGC AGATTAATTA CTGAGAGTAT 
GTTAATGTGT CATCCCTCCT ATATAACGTA TTTGCATHT AATG6AGCAA TTCTGGAGAT 
AATCCCT6AA GGCAAA6GAA TGAATCTTGA GGGTGAGAAA GCCAGAATCA GTGTCCAGCT 
GCAGTTGTGG GAGAAGGT6A TATTATGTAT GTCTCAGAAG TGACACCATA TGGGCAACTA 
CTAAGCCCGA ATTCCAGCAC ACTGGCGGGC GTTACTAATG GATCCGAGCT CGGTACCAAG 
CTTGATGCAT AGCTTGAGTA TCTATAGTGT CACTAAATAG CCTGGCGTTA TCATGGTCAT 
AGCTGTTTCC TGTGTGAAAT TGTTATCC6C TCCCAATTCC CCCCACCATA CGAGCCGGAA 
CATAAAGT 

(2) INFORMATION FOR SEQ ID N0:47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
TGCCAACAAG GAAAGHTTA AATTTCCCCT TGAGGATTCT TGGTGATCAT CAAATTCAGT 
GGHTTTAAG GTTGTHTCT GTCAAATAAC TCTAACHTA AGCCAAACAG TATATGGAAG 
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CACAGATAKA ATATTACACA GATAAAAGAG GAGTT6ATCT AAAGTARAGA TAGTTGGGGG 180 

CmAAHTC TGGAACCTAG GTCTCCCCAT CTTCTTCTGT GCTGAQGAAC TTCTTGGAAG 240 

CGGGGATTCT AAAGHCnT GGAAGACAGT TTGAAAACCA CCATGTTGTT CTCAGTACCT 300 

TTATTTTTAA AAAGTAGGTG AACATTTTGA GAGAGAAAAG GGCTTG6TTG AGATGAAGTC 360 

CCCCCCCCCC CnTTTTTTT TTTTAGCTGA AATAGATACC CTATGnNAA RGAARGGATT 420 

ATTATHACC ATGCCAYTAR SCACATGCTC TTTGATGGGC NYCTCCSTAC CCTCCHAAG 480 

(2) INFORMATION FOR SEQ ID N0:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 48: 

AAGAGGGTAC CGAGTGGAAT TTCCGCTTCA CTAGTCTGGT GTGGCTAGTC GGTTTCGTGG 60 

TGGCCAACAT TACGAACTTC CAACTCAACC GHCnCGAC GHCAAGCGG 6AGTACCGGC 120 

GAGGATGGTG GCGTGAATTC TGGCCTTTCT TTGCCGTGGG ATCGGTAGCC GCCATCATCG 180 

GTATGTTTAT CAAGATCTTC TTTACTAACC CGACCTCTCC GATTTACCTG CCCGAGCCGT 240 

GGTTTAACGA G6GGAGGGGG ATCCAGTCAC GCGAGTACTG GTCCCAGATC HCGCCATCG 300 

TCGT6ACAAT 6CCTATCAAC TTCGTCGTCA ATAAGTTGTG GACCHCCGA ACGGTGAAGC 360 
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ACTCCGAAAA CGTCCQGTGG CTGCTGTGCG GTGACTCCCA AAATCTTGAT AACAACAAGG 420 

TAACCGAATC GCGCTAAGGA ACCCCGGCAT CTCGGGTACT CTGCATATGC GTACCCCTTA 480 

AGCCGAATTC CAGCACACTG GCGGCCGTTA CTAATTGGAT CCGAACTCCG TAACCAAGCC 540 

TGATGC6TAA CTT6AGTTAT TCTATAGTGT CCCTAAAATA ACCTGGCGTT A 591 
(2) INFORMATION FOR SEQ ID N0:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:49: 

AAGAGGGTAC CTGCCTTGAA ATTTAAATGT CTAAGGAAAR TGGGAGATGA TTAAGAGHG 60 

GTGTGGCYTA GTCACACCAA AATGTATTTA TTACATCCTG CTCCTTTCTA GTTGACAGGA 120 

AAGAAAGCTG CTGTGG6GAA AGGAGGGATA AATACTGAAG GGATTTACTA AACAAATGTC 180 

CATCACAGAG TTTTCCTTTT TTTTTTTTTG AGACAGAGTC TTGCTCTGTC ACCCAGGCTG 240 

GAATGAAGWG GTATGATCTC AGTTGAATGC AACCTCTACC TCCTAGGTTC AAGCGATTCT 300 

CATGCCTCAG CCTCCTGAGC AGCTGGGACT ATAGGCGCAT GCTACCATGC CAGGCTAATT 360 

TTTATATTTT TATTAGAGAC GGGGTGTTGC CATGTTGGCC AGGCAGGTCT CGAACTCCTG' 420 

GGCCTCAGAT GATCTGCCCC ACCGTACCCT CTTA 454 
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(2) INFORMATION FOR SEQ 10 N0:50; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:50: 

AA6AGGGTAC CAAAAAAAAG AAAAAGGAAA AAAAGAAAAA CAACTTGTAl AAGGCTTTCT 60 

GCTGCATACA GCTTTTHTT TTTAAATAAA TGGTGCCAAC AAATGTTTTT GCATTCACAC 120 

CAATTGCTGG TTTTGAAATC GTACTCTTCA AAGGTAHTG TGCAGATCAA TCCAATAGTG 180 

ATGCCCCGTA GGTTTTGTG6 ACTGCCCACG TTGTCTACCT TCTCATGTAG GAGCCATTGA 240 

GAGACTGTH GGACATGCCT GTGTTCATGT AGCCGTGATG TCCGG6GGCC GTGTACATCA 300 

TGTTACCGTG GGGTGGGGTC TGCATTGGCT GCTGGGCATA TGGCTGGGTG CCCATCATGC 360 

CCATCTGCAT CTGCATAGGG TATTGGGGCG TTTGATCCAT ATAGCCAT6A TTGCTGTGGT 420 

AGCCACTGTT CATCATTGGC TGGGACATGC TGTTACCCTC TTA 463 
(2) INFORMATION FOR SEQ ID N0:51: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEO 10 NO: 51: 

CTTCAACCTC CCAAAGTGCT GGGATTACAG GACTGAGCCA CCACGCTCAG CCTAAGCCTC 60 

TTTTTCACTA CCCTCTAAGC GATCTACCAC AGTGATGAGG 6GCTAAAGAG CAGTGCAATT 120 

TGATTACAAT AATGGAACTT AGATTTATTA ATTAACAAH TTTCCTTAGC ATCTIGGHC 180 

CATAATTATT AAGAGTATGG ACTTACTTAG AAATGAGCH TCAHTTAAG AATTTCATCT 240 

TTGACCTTCT CTAHAGTCT GAGCAGTATG ACACTATACG TAnTTATTT AACTAACCTA 300 

CCnGAGCTA TTACTTHTA AAAGGCTATA TACATGAAT6 TGTATTGTCA ACTGTAAAGC 360 

CCCACAGTAT TTAATTAIAT CATGATGTCT TTGAGGTTG 399 
(2) INFORMATION FOR SEQ ID N0:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:52; 

CTTCAACCTC AATCAACCTT GGTAATTGAT AAAATCATCA CTTAACTTTC T6ATATAATG 60 

GCAATAATTA TCTGAGAAAA AAAAGTGGTG AAAGATTAAA CTTGCAHTC TCTCAGAATC 120 

TTGAAGGATA HTGAATAAT TCAAAAGCGG AATCAGTAGT ATCAGCCGAA GAAACTCACT 180 

TAGCTAGAAC GTTGGACCCA TGGATCTAAG TCCHGCCCT TCCACTAACC AGCTGATTGG 240 

TTHGTGTAA ACCTCCTACA CGCTTGGGCT TGGTCGCCTC ATTT6TCAAA GTAAAGGCTG 300 
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AAATAGGAAG ATAATGAACC GTGTCTTTTT GGTCTCTTTT CCATCCATTA CTCTGATTTT 360 
ACAAAGAGGC CTGTATTCCC CTGGT6AGGT TG 392 
(2) INFORMATION FOR SEQ 10 N0:53: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:53: 
TTCGGGTGAT GCCTCCTCAG GCTACAGTGA AGACTGGATT ACAGAAAGGT GCCAGCGAGA 60 
TTTCAGATTC CTGTAAACCT CTAAAGAAAA GGAGTCGCGC CTCAACTGAT GTAGAAATGA 120 
CTAGTTCAGC ATACNGAGAC ACNTCTGACT CCGATTCTAG AGGACTGAGT GACCTGCAN 179 
(2) INFORMATION FOR SEQ ID N0:54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:54: 
TTCGGGTGAT GCCTCCTCAG GCTACATCAT NATAGAAGCA AAGTAGAANA ATCNNGTTTG 60 



TGCATTTTCC CACANACAAA ATTCAAATGA NTGGAAGAAA TTGGGANAGT AT 



112 
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(2) INFORMATION FOR SEO 10 N0:55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:55: 

TGAGCTTCCG CTTCTGACAA CTCAATAGAT AATCAAA6GA CAACHTAAC AGGGATTCAC 60 

AAAGGAGTAT ATCCAAATGC CAATAAACAT ATAAAAAGGA ATTCAGCTTC ATCATCATCA 120 

GAAGWATGCA AATTAAAACC ATAATGAGAA ACCACTATGT CCCACTAGAA TAGATAAAAT 180 

CTTAAAAGAC TG6TAAAACC AAGTGTTGGT AAGGCAAGAG GAGCA 225 
(2) INFORMATION FOR SEQ ID N0:56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:56: 

GCTCCTCTTG CCTTACCAAC ACATTCTCAA AAACCTGTTA GAGTCCTAAG CATTCTCCTG 60 

TTAGTATTGG GATTTTACCC CTGTCCTATA AAGATGTTAT GTACCAAAAA TGAAGTGGAG 120 

GGCCATACCC'TGAGGGAGGG GAGGGATCTC TAGTGTTGTC AGAAGC6GAA GCTCA 175 
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(2) INFORMATION FOR SEQ ID N0:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:57: 

AGCCATTTAC CACCCATGGA TGAATGGATT TTGTAATTCT AGCTGTTGTA TTTTGTGAAT 60 

TTGTTAATTT TGTTGTTTTT CTGTGAAACA CATACATTGG ATATGGGAGG TAAAGGAGTG 120 

TCCCAGTTGC TCCTGGTCAC TCCCTTTATA GCCATTACT6 TCTTGTTTCT TGTAACTCAG 180 

GTTAGGTTTT GGTCTCTCTT GCTCCACTGC AAAAAAAAAA AAA 223 
(2) INFORMATION FOR SEQ ID N0:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:58: 

GTTCGAAGGT GAACGTGTAG GTAGCGGATC TCACAACTGG GGAACTGTCA AAGACGAATT 60 

AACTGACTTG GATCAATCAA ATGTGACTGA GGAAACACCT GAAGGTGAAG AACATCATCC 120 

AGTGGCAGAC ACTGAAAATA A6GAGAATGA AGTTGAAGAG GTAAAAGAGG AGGGTCCAAA 180 
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AGAGATGACi TTGGATGG6T GGTAAATGGC T 211 
(2) INFORMATION FOR SEQ ID N0:59: 

(1) StOUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:59: 

GCTCCTCTTG CCTTACCAAC TTTGCACCCA TCATCAACCA TGTGGCCAGG TTTGCAGCCC 60 

AGGCTGCACA TCAGGGGACT GCCTCGCAAT ACTTCATGCT GTTGCTGCTG ACTGATGGTG 120 

CTGTGACGGA TGTGGAAGCC ACACGT6AGG CTGTGGTGCG TGCCTCGAAC CT6CCCATGT 180 

CAGTGATCAT TATGGGTGGT AAATGGCT 208 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:60: 
AGCCATTTAC CACCCATACT AAATTCTAGT TCAAACTCCA ACTTCTTCCA TAAAACATCT 60 
AACCACTGAC ACCAGTTGGC AATAGCTTCT TCCTTCTTTA ACCTCTTAGA GTATTTATGG 120 
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TCAAT6CCAC ACATTTCTGC AACTGAATAA AGTTGGTAAG GCAAGAGGAG C 171 
(2) INFORMATION FOR SEQ ID N0:61: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:61: 
CG6GTGATGC CTCCTCAGGC TTTGGTGTGT CCACTCNACT CACTGGCCTC TTCTCCAGCA 60 
ACTGGTGAAN ATGTCCTCAN GAAAANCNCC ACACGCNGCT CAGGGTGGGG TGGGAANCAT 120 
CANAATCATC N6GC 134 
(2) INFORMATION FOR SEQ ID N0:62: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:62: 
AGAGGGTACA TATGCAACAG TATATAAAGG AAGAAGTGCA CTGAGAGGAA CTTCATCAAG 60 
GCCATTTAAT CAATAAGTGA TA6AGTCAAG GCTCAACCCA GGTGTGACGG ATTCCAGGTC 120 



CCAAGCTCCT TACTGGTACC CTCTT 



145 
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(2) INFORMATION FOR SEQ ID N0:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:63: 

TGCACTGAGA GGAATTCAAA 6GGTTTATGC CAAAGAACAA ACCAGTCCTC TGCAGCCTAA 60 

CTCATTTGTT TTTGGGCTGC GAAGCCATGT AGA6GGCGAT CAGGCAGTAG ATGGTCCCTC 120 

CCACAGTCAG CGCCATGGTG GTCCGGTAAA GCATTTGGTC AGGCAGGCCT CGTTTCAGGT 180 

AGACGGGCAC ACATCAGCTT TCTGGAAAAA CTTTTGTAGC TCTGGAGCTT TGTTTTTCCC 240 

AGCATAATCA TACACTGTGG AATCGGAGGT CAGTTTAGTT GGTAAGGCAA GAGGAGC 297 
(2) INFORMATION FOR SEQ ID N0:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GCACTGAGAG GAACTTCCAA TACTATGHG AATAGGAGTG GTGAGA6AGG GCATCCTTGT 60 
CTTGTGCCGG TTTTCAAAGG GAATGCTTCC AGCTHTGCC CATTCAGTAT AATATTAAAG 120 
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AATGTTTTAC CATTTTCTGT CTTGCCTGTT TTTCTGTGTT TTTGTT6GTC TCTTCATTCT 180 
CCATTHTAG GCCTTTACAT GTTAGGAATA TATTTCnn AATGATACTT CACCTTTGGT 240 
ATCTTTTGTG AGACTCTACT CATAGTGTGA TAAGCACTGG GTTGGTAA6G CAAGAGGAGC 300 



(2) INFORMATION FOR SEQ ID N0:65: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:65: 

GCTCCTCTTG CCTTACCAAC TCACCCA6TA TGTCAGCAAT TTTATCRGCT TTACCTACGA 60 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATC6CCTT 120 

CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CTTTCTCTTT TCCTAGAACA TGCATTTARG 180 

TCGATAGAAG TTCCTCTCAG TGC 203 
(2) INFORMATION FOR SEQ ID N0:66: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:66: 

TACGGGGACC CCTGCATTGA GAAAGCGAGA CTCACTCTGA AGCTGAAATG CTGTTGCCCT 60 

TGCAGTGCTG GTAGCAGGAG TTCTGTGCTT TGTGGGCTAA GGCTCCT6GA TGACCCCTGA 120 

CATGGAGAAG GCAGAGTTGT GTGCCCCTTC TCATGGCCTC GTCAAGGCAT CATGGACTGC 180 

CACACACAAA ATGCCGTTTT TATTAACGAC ATGAAATTGA AGGAGAGAAC ACAATTCACT 240 

GATGTGGCTC GTAACCATGG ATATGGTCAC ATACAGAGGT GTGATTATGT AAAGGTTAAT 300 

TCCACCCACC TCATGTGGAA ACTAGCCTCA ATGCAGGGGT CCCA 344 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:67: 
GCACTGAGAG GAACTTCGTA GGGAGGTTGA ACTGGCTGCT GAGGAGGGGG AACAACAGGG 60 
TAACCAGACT GATAGCCATT GGAIGGATAA TATGGTGGTT GAGGAGGGAC ACTACTTATA 120 
GCAGAGGGTT GTGTATAGCC TGAGGAGGCA TCACCCG 157 
(2) INFORMATION FOR SEQ ID N0:68: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:68: 
GCACTGAGAG GAACTTCTAG AAAGTGAAAG TCTAGACATA AAATAAAATA AAAATTTAAA 60 
ACTCAGGAGA GACAGCCCAG CACGGTGGCT CACGCCTGTA ATCCCAGAAC TTTGGGAGCC 120 
TGAGGAGGCA TCACCCG 137 
(2) INFORMATION FOR SEO ID N0:69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:69: 
CGGGTGATGC CTCCTCAG6C TGTATTTTGA AGACTATCGA CTGGACTTCT TATCAACTGA 60 
A6AATCCGTT AAAAATACCA GTTGTATTAT TTCTACCTGT CAAAATCCAT TTCAAATGTT 120 
GAAGTTCCTC TCAGTGC 137 
(2) INFORMATION FOR SEQ ID NO: 70: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:70: 

A6CATGTTGA GCCCAGACAC GCAATCTGAA T6AGTGTGCA CCTCAAGTAA ATGTCTACAC 60 

GCT6CCTGGT CTGACATGGC ACACCATCNC GTGGAGGGCA CASCTCTGCT CNGCCTACWA 120 

CGAGGGCANT CTCATWGACA GGTTCCACCC ACCAAACTGC AAGAGGCTCA NNAAGTACTR 180 

CCAGGGTMYA SGGACMASGG TGGGAYTYCA YCACWCATCT 220 
(2) INFORMATION FOR SEQ ID N0:71: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:71: 

CGTTAGGGTC TCTATCCACT GCTAAACCAT ACACCTGGGT AAACAGGGAC CATHAACAT 60 

TCCCANCTAA ATATGCCAAG TGACTTCACA TGTTTATCTT AAAGATGTCC AAAACGCAAC 120 

TGATTTTCTC CCCTAAACCT GTGATGGTGG GATGAHAAN CCTGAGTGGT CTACA6CAAG 180 

TTAAGTGCAA G6TGCTAAA1 GAANGTGACC TGAGATACAG CATCTACAAG GCAGTACCTC 240 

TCAACNCAGG GCAACTTTGC TTCTCANAGG GCATTTAGCA 6TGTCTGAAG TAATTTCT6T 300 

ATTACAACTC ACGGGGCGGG GGGTGAATAT CTANTGGANA GNAGACCCTA ACG 353 
(2) INFORMATION FOR SEQ ID NO: 72: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:72: 

GCACTGAGAG GAACTTCCAA TACYATKATC AGAGT6AACA RGCARCCYAC AGAACAGGAG 60 

AAAATGTTYG CAATCTCTCC ATCTGACAAA AGGCTAATAT CCAGAWTCTA AWAGGAACTl 120 

AAACAAATTT ATGAGAAAAG AACARACAAC CTCAWCAAAA AGTG6GTGAA GGAWATGCTS 180 

AAARGAAGAC ATYTATTCA6 CCAGTAAACA YATGAAAAAA AGGCTCATSA TCACTGAWCA 240 

TTA6AGAAAT GCAAATCAAA ACCACAATGA GATACCATCT YAYRCCAGTT AGAAYGGTGA 300 

TCATTAAAAR STCA6GAAAC AACA6ATGCT GGACAAGGTG TCA 343 
(2) INFORMATION FOR SEQ ID N0:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xj) SEQUENCE DESCRIPTION: SEQ ID N0:73: 
GCACTGAGAG GAACTTCAGA GAGAGAGAGA GAGTTCCACC CT6TACTTGG GGAGAGAAAC 60 
AGAAGGTGA6 AAAGTCTTT6 6TTCTGAA6C AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 120 
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TCAAAGTTCC CATGCTGCCA AAGT6CCATC CTTTGGGGTA CTGTTTTCTG AGCTCCAGTG 180 

ATAACTCATT TATACAAGGG AGATACCCAG AAAAAAAGTG AGCAAATCTT AAAAAGGT6G 240 

CTTGAGTTCA GCCTTAAATA CCATCTTGAA ATGACACAGA GAAAGAANGA TGHGGGTGG 300 

GA6TGGATAG AGACCCTAAC G 321 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321- base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

GCACTGAGAG GAACTTCAGA GAGAGAGAGA GAGTTCCACC CTGTACTTGG GGAGAGAAAC 60 

AGAAGGTGAG AAAGTCTTTG GTTCTGAAGC AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 120 

TCAAAGHCC CATGCTGCCA AAGTGCCATC CTTTGGGGTA CTGTTHCTG AGCTCCAGTG 180 

ATAACTCATT TATACAAGGG AGATACCCAG AAAAAAAGTG AGCAAATCTT AAAAAGGTG6 240 

CTTGAGTTCA GYCTTAAATA CCATCTTGAA ATGAMACA6A GAAAGAAGGA TGTTGGGTG6 300 

GAGTGGATAG AGACCCTAAC G 321 

(2) INFORMATION FOR SEQ 10 NO: 75: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 317 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:75: 

GCACTGAGAG GAACTTCCAC ATGCACTGAG AAATGCATGT TCACAAGGAC TGAAGTCTGG 60 

AACTCAGTTT CTCAGTTCCA ATCCT6ATTC AGGTGTTTAC CAGCTACACA ACCTTAAGCA 120 

AGTCAGATAA CCTTAGCTTC CTCATATGCA AAATGAGAAT GAAAA6TACT CATCGCTGAA 180 

TTGTTTTGAG 6ATTAGAAAA ACATCTGGCA TGCAGTAGAA ATTCAATTAG TATTCATTTT 240 

CATTCTTCTA AATTAAACAA ATAGGATTTT TAGTGGTG6A ACTTCAGACA CCA6AAATGG 300 

GAGTGGATAG AGACCCT 317 
(2) INFORMATION FOR SEQ ID N0:76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:76: 

CGTTAGGGTC TCTATCCACT CCCACTACTG ATCAAACTCT ATTTATTTAA TTATTTTTAT 60 

CATACHTAA GTTCTGGGAT ACACGTGCAG CATGCGCAGG TTTGTTGCAT AGGTATACAC 120 

TTGCCATGGT GGTTTGCTGC ACCCATCAGT CCATCATCTA CAHAGGTAT TTCTCCTAAT 180 
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GCTATCCCTC CCCTAGCCCC TTACACCCCC AACAGGCTCT AGTGTGTGAA GTTCCTCTCA 240 
GTGC 244 
(2) INFORMATION FOR SEQ ID N0:77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

CGTTAGGGTC TCTATCCACT GAAATCTGAA GCACAGGAGG AA6AGAAGCA GTYCTAGTGA 60 

6ATGGCAAGT TCWTTTACCA CACTCTTTAA CATTTYGTTT AGTTTTAACC TTTATTTATG 120 

GATAATAAAG GTIAATATTA ATAATGATTT ATTTTAAGGC ATTCCCRAAT TTGCATAATT 180 

CTCCTTTTGG AGATACCCTT TTATCTCCAG TGCAAGTCTG GATCAAAGTG ATASAMAGAA 240 



GTTCCTCTCA GTGC 
(2) INFORMATION FOR SEQ ID N0:78: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:78: 
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TTCGATACA6 GCAAACATGA ACTGCAGGAG GGTGGTGACG ATCATGAT6T TGCC6ATGGT 60 

CCGGATGGNC ACGAAGACGC ACT6GANCAC GTGCTTACGT CCTTTTGCTC TGTTGATGGC 120 

CCTGAGGGGA CGCAGGACCC TTATGACCCT CAGAATCTTC ACAACGGGAG ATGGCACTGG 180 

ATTGANTCCC ANTGACACCA GAGACACCCC AACCACCAGN ATATCANTAT ATTGATGTAG . 240 

TTCaGTAGA NGGCCCCCTT GTGGAGGAAA GCTCCATNAG TTG6TCATCT TCAACAQGAT 300 

CTCAACAGTT TCCGATGGCT GTGATGGGCA TAGTCATANT TAACCNTGTN TCGAA 355 
(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:79: 

1AAGAGGGTA CCAGCAGAAA GGTTAGTATC ATCAGATAGC ATCHATACG AGTAATATGC 60 

CTGCTATTTG AAGTGTAATT GAGAAG6AAA ATTTTAGCGT GCTCACT6AC CTGCCTGTA6 120 

CCCCAGTGAC AGCTAGGATG TGCATTCTCC A6CCATCAAG AGACTGAGTC AAGTTGHCC 180 

TTAAGTCAGA ACAGCAGACT CAGCTCTGAC ATTCTGATTC GAATGACACT GTTCAGGAAT 240 

CGGAATCCTG TCGATTAGAC TGGACAGCH GTGGCAAGTG AATTTGCCTG TAACAAGCCA 300 

GATTTTTTAA AATTTATATT GTAAATAATG TGTGTGT6TG TGTGTGTATA TATATATATA 360 

TGTACAGTTA TCTAA6TTAA TTTAAAAGTT GTTTGGTACC CTCTTA 406 
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(2) INFORMATION FOR SEQ ID N0:80: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:80: 

TTTTTTTTTT TTTACTCGGC TCAGTCTAAT CCTTTTTGTA GTCACTCATA GGCCAGACTT 60 

AGGGCTA6GA T6ATGATTAA TAAGAGGGAT GACATAACTA TTAGTGGCAG GTTAGTTGTT 120 

TGTAGGGCTC ATGGTAGGGG TAAAAGGAGG GCAATTTCTA GATCAAATAA TAAGAAGGTA 180 

ATAGCTACTA AGAAGAATTT TATGGAGAAA GG6AC6CGGG CGGGGGATAT AGGGTCGAAG 240 

CCGCACTCGT AAGGGGTGGA TTTTTCTATG TAGCCGTTGA GTTGTGGTAG TCAAAATGTA 300 



ATAATTATTA GTAGTAAGCC TAGGAGA 
(2) INFORMATION FOR SEQ ID N0:81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



327 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:81: 
TAGTCTATGC 6GTTGATTCG GCAATCCATT ATTTGCTGGA HTTGTCATG TGTTTTGCCA 



60 
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ATTGCATTCA TAATTTATTA TGCATTTATG CTTGTATCTC CTAAGTCATG GTATATAATC 120 

CATGCTTTTT ATGTmGTC TGACATAAAC TCTTATCAGA GCCCTTTGCA CACAGGGATT 180 

CAATAAATAT TAACACAGTC TACATTTATT TGGTGAATAT TGCATATCTG CTGTACTGAA 240 

AGCACATTAA GTAACAAAGG CAAGTGAGAA GAATGAAAAG CACTACTCAC AACAGTTATC 300 

ATGATTGCGC ATAGACTA 313 
(2) INFORMATION FOR SEO ID N0:82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:82: 

TCTTCAACCT CTACTCCCAC TAATAGCTTT TTGATGACTT CTAGCAAGCC TCGCTAACCT 60 

CGCCTTACCC CCCACTATTA ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACGTTCTC 120 

CTGATCAAAT ATCACTCTCC TACTTACAGG ACTCAACATA QAGTCACAG CCCTATACTC 180 

CCTCTACATA TTTACCACAA CACAATGGGG CTCACTCACC CACCACATTA ACAACATAAA 240 

ACCCTCATTC ACACGAGAAA ACACCCTCAT GHCATACAC CTATCCCCCA TTCTCCTCCT 300 

ATCCCTCAAC CCCGACATCA TTACCGGGH TTCCTCH 333 
(2) INFORMATION FOR SEQ ID N0;83: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:83: 
AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AGGAATAAAA 60 
ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA GGCATCACCC G 111 
(2) INFORMATION FOR SEQ ID N0:84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:84: 

TCGGGTGATG CCTCCTCAGG CCAAGAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 60 

AAG6AA6AAA GGAGAAAAAA GGGCATCATC CCCGTTCCGA AGGGTCAGGG AGGAGGAAAT 120 

TGAGGT6GAT TCACGAGTTG CGGACAACTC CTTTGATGCC AAGCGAGGTG CAGCCGGAGA 180 

CTGGGGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 224 

(2) INFORMATION FOR SEQ ID N0:85: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 348 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEO ID NO:85: 

GCACTGAGAG GAACHCGTT GGAAACGGGT THHTCATG TAAGGCTAGA CAGAAGAATT 60 

CTCAGTAACT TCCTTGTGTT GTGTGTATTC AACTCACASA GTTGAACGAT CCTTTACACA 120 

GAGCAGACTT GTAACACTCT TMTTGTGGAA THGCAAGTG GA6ATTTCAG SCGCHTGAA 180 

GTSAAAGGTA GAAAAGGAAA TATCTTCCTA TAAAAACTAG ACAGAATGAT TCTCAGAAAC 240 

TCCTTTGTGA TGTGT6CGTT CAACTCACAG AGTTTAACCT TTCWTTTCAT AGAAGCAGTT 300 

AGGAAACACT CTGTTTGTAA AGTCTGCAAG TGGATAGAGA CCCTAACG 348 
(2) INFORMATION FOR SEQ ID N0:86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 293 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:86: 

GCACTGAGAG 6AACTTCYTT GTGWTGTKTG YATTCAACTC ACAGAGTTGA ASSWTSMHT 60 

ACABAGWKCA GGCTTKCAAA CACTCTTTTT GTMGAATYTG CAAGWGGAKA TTTSRRCCRC 120 

TTTGWGGYCW WYSKTMGAAW MGGRWATATC TTCWYATMRA AMCTAGACAG AAKSATTCTC 180 
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AKAAWSTYYY YTGTGAWGWS TGCRTTCAAC TCACAGAGKT KAACMWTYCT KYTSATRGAG 240 
CA6TTWKGAA ACTCTMTTTC TTTGGATTCT GCAAGTGGAT AGAGACCCTA ACG 293 
(2) INFORMATION FOR SEQ ID NO:87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CTCCTAGGCT 10 
(2) INFORMATION FOR SEQ ID N0:88: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:88: 
AGTAGTTGCC 10 
(2) INFORMATION FOR SEQ ID N0:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: U base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:89: 
TTCCGTTATG C 

(2) INFORMATION FOR SEQ ID N0:90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:90: 
T6GTAAAGGG 

(2) INFORMATION FOR SEQ ID N0:91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:91: 
TCGGTCATAG 

(2) INFORMATION FOR SEQ ID N0:92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:92: 
TACAACGAGG 

(2) INFORMATION FOR SEQ ID N0:93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:93: 
TGGATTGGTC 

(2) INFORMATION FOR SEQ ID N0:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID N0:94: 
CTTTCTACCC 
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(2) INFORMATION FOR SEQ ID N0:95: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:95: 
TTTTGGCTCC 10 
(2) INFORMATION FOR SEQ ID N0:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:96: 
GGAACCAATC 10 
(2) INFORMATION FOR SEQ ID N0:97: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:97: 
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TCGATACAGG 

(2) INFORMATION FOR SEQ ID N0:98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEC ID N0:98: 
GGTACTAAGG 

(2) INFORMATION FOR SEQ ID N0:99: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:99: 
AGTCTATGCG 

(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100; 
CTATCCATGG 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
TCTGTCCACA 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:102: 
AAGAGGGTAC 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CTTCAACCTC 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:104: 
GCTCCTCTTG CCTTACCAAC 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:105: 
GTAAGTCGAG CAGT6TGATG 
(2) INFORMATION FOR SEQ ID NO: 106: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:106: 
GTAAGTCGAG CAGTCTGATG 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GACTTAGTGG AAAGAATGTA 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
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GTAATTCCGC CAACCGTAGT 

(2) INFORMATION FOR SEQ ID NO: 109: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 20 bdSG pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:109: 
ATGGTTGATC GATAGTGGAA 
(2) INFORMATION FOR SEQ ID NO: 110: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:110: 
ACGGGGACCC CTGCAHGAG 
(2) INFORMATION FOR SEQ ID NO: 111: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 
TATTCTAGAC CATTCGCTAC 20 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(XT) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
ACATAACCAC TTTAGCGHC 20 
(2) INFORMATION FOR SEQ ID NO: 113: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 
CGGGTGATGC CTCCTCAGGC 20 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
AGCATGTTGA GCCCAGACAC 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GACACCTTGT CCAGCATCT6 
(2) INFORMATION FOR SEQ ID NO: 116: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:116: 
TACGCTGCAA CACTGTGGAG 
(2) INFORMATION FOR SEQ ID N0:117: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:117: 
C6TTAGG6TC TCTATCCACT 
(2) INFORMATION FOR SEQ ID NO: 118: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
AGACTGACTC ATGTCCCCTA 
(2) INFORMATION FOR SEQ 10 NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
• (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 



TCATCGCTCG GTGACTCAAG 



20 
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(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:120: 
CAAGATTCCA TA6GCTGACC 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
ACGTACTG6T CTTGAAGGTC 
(2) INFORMATION FOR SEQ ID NO: 122: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GAC6CTT6GC CACTTGACAC 
(2) INFORMATION FOR SEQ ID NO: 123: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:123: 
GTATCGACGT AGTGGTCTCC 
(2) INFORMATION FOR SEQ ID N0:124: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
TAGTGACATT ACGACGCTGG 
(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:125: 
CGG6TGATGC CTCCTCA6GC 
(2) INFORMATION FOR SEQ ID NO: 126: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
ATGGCTATTT TCGGGGGCTG ACA 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:127: 
CCGGTATCTC CTCGTGGGTA TT 
(2) INFORMATION FOR SEQ ID NO: 128: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 18 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:128: 
CTGCCTGAGC CACAAATG 
(2) INFORMATION FOR SEQ ID N0:129: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:129: 
CCGGAGGAGG AAGCTAGAGG AATA 
(2) INFORMATION FOR SEQ ID NO; 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPaOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:130: 



THTTHHT TTAG 



wo 97/25426 



PCTAJS97/00485 



114 

(2) INFORMATION FOR SEQ ID NO: 131: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Ser Scr Gly Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val 
15 10 15 

Gly He 



(2) INFORMATION FOR SEQ ID NO: 132: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:132: 

Gin Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Xaa He Glu Val 
15 10 15 

Val Gin Gly His Asp Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 133: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:133: 

Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu Ala Tyr Arg He Tyr 
15 10 15 

Thr Pro Phe Asp Leu Ser Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 134: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:134: 

Tyr Leu Leu Val Gly lie Gin Gly Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(x1) SEQUENCE DESCRIPTION: SEQ ID N0:135: 

Gly Ala Ala Gin Lys Pro He Asn Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

Asn Leu Sen Lys Xaa He Glu Val Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13V; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Glu Val Val Gin Gly His Asp Glu Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 138: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

His Leu Gin Glu Ala Tyr Arg He Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:139: 

Asn Leu Ala Phe Val Ala Gin Ala Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:140: 

Phe Val Ala Gin Ala Ala Pro Asp Sen 
1 5 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 141: 



GCTCGCGGCC 6CGAGCTCAA TTAACCCTCA CTAAA6GGAG TCGACTCGAT CAGACTGTTA 60 



CT6TGTCTAT GTAGAAAGAA GTAGACATAA GAGATTCCAT TTTGTTCTGT ACTAAGAAAA 120 



ATTCTTCTGC CTTGAGATGC TGTTAATCTG TAACCCTAGC CCCAACCCTG TGCTCACAGA 180 



GACATGTGCT GTGTTGACTC AAGGTTCAAT GGATTTAGGG CTATGCTHG TTAAAAAAGT 240 



GCTT6AAGAT AATATGCTTG TTAAAAGTCA TCACCATTCT CTAATCTCAA GTACCCAGGG 300 



ACACAATACA CTGCGGAAGG CCGCAGGGAC CTCTGTCTAG GAAAGCCAGG TATTGTCCAA 360 



GATTTCTCCC CATGTGATAG CCTGAGATAT GGCCTCATGG GAAG6GTAAG ACCTGACTGT 420 



CCCCCAGCCC GACATCCCCC AGCCCGACAT CCCCCAGCCC GACACCCGAA AAGGGTCTGT 480 



GCTGAGGAGG ATTAGTAAAA GAGGAAGGCC TCHTGCAGT TGAGGTAAGA GGAAGGCATC 540 



TGTCTCCTGC TCGTCCCTGG GCAATAGAAT GTCHGGTGT AAAACCCGAT TGTATGHCT 600 
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ACTTACTGAG ATAGGAGAAA ACATCCTTAG GGCTGGAGGT GAGACACGCT GGCGGCAATA 660 

CTGCTCTTTA ATGCACCGAG ATGTTTGTAT AAGTGCACAT CAAGGCACAG CACCTHCCT 720 

TAAACTTATT TATGACACAG AGACCTHGT TCACGTTTTC aGCTGACCC TCTCCCCACT 780 

ATTACCCTAT TGGCaGCCA CATCCCCCTC TCCGAGATGG TAGAGATAAT GATCAATAAA 840 

TACTGAGGGA ACTCA6AGAC CA6TGTCCCT GTAGGTCCTC CGTGTGCTGA GCGCCGGTCC 900 

CTTGGGCTCA CnTTCTnC TCTATACTTT GTCTCTGTGT CTCTTTCTTT TCTCAGTCTC 960 

TC6TTCCACC TGAC6AGAAA TACCCACAGG TGTGGAGGGG CAGGCCACCC CTTCAATAAT 1020 

TTACTAGCCT GTTCGCTGAC AACAAGACTG GTGGTGCAGA AGGTTGGGTC TTGGTGTTCA 1080 

CCGGGTGGCA GGCATGGGCC AGGTGGGAGG GTCTCCAGCG CCTGGTGCAA ATCTCCAAGA 1140 

AAGTGCAGGA AACAGCACCA AGGGTGATT6 TAAATHTGA TTTGGC6CGG CAGGTAGCCA 1200 

TTCCA6CGCA AAAAT6CGCA 6GAAAGCTTT TGCTGTGCTT GTAG6CAGGT AGGCCCCAAG 1260 

CACTTCTTAT TGGCTAATGT GGAGGGAACC TGCACATCCA TTGGCTGAAA TCTCCGTCTA 1320 

TTTGAGGCTG ACTGAGCGC6 TTCCTTTCTT CTGTGTTGCC TGGAAACGGA CTGTCTGCCT 1380 

AGTAACATCT GATCACGTn CCCATTG6CC GCCGITTCCG 6AAGCCCGCC CTCCCATTTC 1440 

CGGAAGCCTG GCGCAAGGTT GGTCTGCAGG TGGCCTCCAG GTGCAAAGTG GGAAGTGTGA 1500 

GTCCTCAGTC TIGGGQATT CGGCCACGTG CCTGCCGGAC ATGGGACGCT GGAGGGTCAG 1560 

CAGCGTG6AG TCCTG6CCTT TTGCGTCCAC GG6TGGGAAA TIGGCCAHG CCACGGCGGG 1620 

AACTGGGACT CAGGCTGCCC CCCGGCCGH TCTCATCCGT CCACCGGACT CGTGGGCGCT 1680 
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CGCACIGGCG CTGATGTAGT HCCTGACCT CTGACCCGTA HGTCTCCAG ATTAAAGGTA 1740 

AAAACGGGGC TTTTTCAGCC CACTCGGGTA AAACGCCTTT TGAHTCTAG GCAGGTGTTT 1800 

TGTTGCAC6C CTGGGAGGGA 6TGACCCGCA GGTTGAGGTT TATTAAAATA CATTCCTG6T 1860 

TTATGTTATG TTTATAATAA AGCACCCCAA CCTTTACAAA ATCTCACHT TTGCCAGHG 1920 

TATTATTTAG TGGACTGTCT CTGATAAGGA CAGCCAGTTA AAATGGAATT TTGTTGTTGC 1980 

TAATTAAACC AATTTTTAGT TTTGGTGm 6TCCTAATAG CAACAACHC TCAGGCHTA 2040 

TAAAACCATA TTTCTTGGGG GAAATTTCTG TGTAAGGCAC AGCGAGHAG TTTGGAATTG 2100 

TTTTAAAGGA AGTAA6TTCC TG6TTTTGAT ATCTTAGTAG T6TAATGCCC AACCTGGTTT 2160 

TTACTAACCC TGTTTTTAGA CTCTCCCTTT CCTTAAATCA CCTAGCCTTG TTTCCACCTG 2220 

AATTGACTCT CCCTTAGCTA AGAGCGCCAG ATGGACTCCA TCTTGGCTCT TTCACTGGCA 2280 

GCCCCTTCCT CAAGGACTTA ACTTGTGCAA GCTGACTCCC A6CACATCCA AGAATGCAAT 2340 

TAACTGTTAA GATACTGTGG CAAGCTATAT CCGCAGTTCC GAGGAAHCA TCCGATTGAT 2400 

TATGCCCAAA AGCCCCGCGT CTATCACCTT GTAATAATCT TAAAGCCCCT GCACCTGGAA 2460 

CTATTAACTT TCCTGTAACC ATTTATCCTT TTAACTTTTT TGCHACTTT ATTTCT6TAA 2520 

AATTGHTTA ACTA6ACCTC CCCTCCCCTT TCTAAACCAA AGTATAAAAG AAGATCTAGC 2580 

CCCTTCTTCA GAGCGGAGAG AATTTTGAGC ATTAGCCATC TCTTGGCGGC CAGCTAAATA 2640 

AATGGACTTT TAAniGTCT CAAAGTGTGG CGTinCTCT AACTCGCTCA GGTACGACAT 2700 

TT6GAGGCCC CAGCGAGAAA CGTCACCGGG AGAAAC6TCA CCGGGCGAGA GCCGGGCCCG 2760 
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CTGTGTGCTC CCCCGGAAGG ACAGCCAGCT TGTAGGGGGG AGTGCCACCT GAAAAAAAAA 2820 

THCCAGGTC CCCAAAGGGT GACCGTCHC CGGAGGACAG C6GATCGACT ACCATGCGGG 2880 

TGCCCACCAA AATTCCACCT CTGAGTCCTC AACTGCTGAC CCCGGGGTCA GGTAGGTCAG 2940 

AinGACHT GGTTCTGGCA GAGGGAAGCG ACCCTGATGA GGGTGTCCCT CTTHGACTC 3000 

TGCCCAHTC TCTAG6ATGC TAGAGGGTAG AGCCCTGGTT nCTGTTAGA CGCCTCTGTG 3060 

TCTCTGTCTG G6AGGGAAGT GGCCCTGACA GG6GCCATCC CTTGAGTCAG TCCACATCCC 3120 

AGGATGCTGG GGGACTGAGT CCTGGTTTCT GGCAGACTGG TCTCTCTCTC TCTCTTTTTC 3180 

TATCTCTAAT CTTTCCTTGT TCAGGTTTCT TG6AGAATCT CTGGGAAAGA AAAAAGAAAA 3240 

ACTGTTATAA ACTCTGTGTG AATGGTGAAT GAATGGGGGA GGACAAGGGC TTGCGCTTGT 3300 

CCTCCAGHT GTAGCTCCAC GGCGAAAGCT ACGGAGnCA AGTGGGCCCT CACCTGCGGT 3360 

TCCGTGGCGA CCTCATAAGG CTTAAGGCAG CATCCGGCAT AGCTCGATCC GAGCCGGGGG 3420 

TTTATACCGG CCTGTCAATG CTAAGAGGAG CCCAAGTCCC CTAAGG6GGA GCGGCCAGGC 3480 

GGGCATCTGA CTGATCCCAT CACGGGACCC CCTCCCCTTG TTTGTCTAAA AAAAAAAAAA 3540 

GAAGAAACTG TCATAACTGT HACATGCCC TAGGGTCAAC TGTTTGTTTT ATGITTATTG 3600 

TTCTGHCGG T6TCTATTGT CTTGTTTAGT GGHGTCAAG GiniGCATG TCAGGACGTC 3660 

GATATTGCCC AAGACGTCTG GGTAAGAACT TCTGCAAGGT CCTTAGTGCT GATTTTTTGT 3720 

CACAGGAGGT TAAATHCTC ATCAATCATT TAGGCTGGCC ACCACAGTCC TGTCTTTTCT 3780 

6CCAGAAGCA AGTCAGGT6T TGTTACGGGA ATGAGTGTAA AAAAACATTC GCCTGATTGG 3840 
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GATnCTGGC ACCATGATGG TT6TATTTAG ATTGTCATAC CCCACATCCA GGTTGAHGG 3900 

ACCTCCTCTA AACTAAACTG GTGGTGGGTT CAAAACAGCC ACCCTGCA6A HTCCTTGCT 3960 

CACCTCTTTG GTCATTCTGT AACTTTTCCT GTGCCCnAA ATAGCACACT 6TGTAGGGAA 4020 

ACCTACCCTC GTACTGCTTT ACTTCGTTTA GATTCTTACT CTGTTCCTCT GTGGCTACTC 4080 

TCCCATCTTA AAAACGATCC AAGTGGTCCT TTTCCTCCTC CCTGCCCCCT ACCCCACACA 4140 

TCTCGTTTTC CAGTGCGACA GCAAGTTCAG C6TCTCCAGG ACTTG6CTCT GCTCTCACTC 4200 

CTTGAACCCT TAAAAGAAAA AGCTGGGTH GAGCTATTTG CCTTTGAGTC ATGGAGACAC 4260 

AAAA6GTATT TAG6GTACAG ATCTA6AAGA A6AGAGAGAA CACCTAGATC CAACTGACCC 4320 

AGGAGATCTC GGGCTGGCCT CTAGTCCTCC TCCCTCAATC TTAAAGCTAC AGTGATGTGG 4380 

CAAGT6GTAT TTAGCTGTTG TGGTTTTTCT GCTCHTCTG GTCATGTTGA nCTGTTCTT 4440 

TCGATACTCC AGCCCCCCAG 6GAGTGAGTT TCTCTGTCTG TGCTGGGTTT GATATCTATG 4500 

TTCAAATCTT ATTAAATTGC CTTCAAAAAA AAAAAAAAAA GGGAAACACT TCCTCCCAGC 4560 

CTT6TAAGGG TTGGAGCCCT CTCCAGTATA TGCTGCAGAA TTTTTCTCTC GGTTTCTCAG 4620 

A6GATTAT6G AGTCCGCCTT AAAAAAGGCA AGCTCTGGAC ACTCTGCAAA 6TAGAATGGC 4680 

CAAAGTTTGG AGTTGAGTGG CCCCTTGAAG GGTCACTGAA CCTCACAAH GTTCAAGCTG 4740 

TGTGGCGGGT TGTTACTGAA ACTCCCGGCC TCCCTGATCA GTTTCCCTAC ATTGATCAAT 4800 

GGCTGAGTTT G6TCAG6AGC ACCCCTTCCA TGGCTCCACT CATGCACCAT TCATAATTTT 4860 

ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC GACCCTCAGC CGGTTCAGCT 4920 
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CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC CTCACCCAGT CCCACCGCCT 4980 

TAAAACCAGC CTACTCCCTT AGGGTCATCC CATGTCTCCT CGGCTATGTC CCCTGTAGGC 5040 

TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG AAGTAGCCCC TCTACTACCA 5100 

CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA CCCCCTTCCT GGTTTATGTC 5160 

CCTTCTTTCT ACTTCTGACT TGTATAATT6 GAAAACCCAT AATCCTCCCT TCTCTGAAAA 5220 

GCCCCAGGCT TTGACCTCAC TGATGGAGTC TGTACTCTGG ACACATTGGC CCACCTGGGA 5280 

TGACTGTCAA CAGCTCCTTT TGACCCTTTT CACCTCTGAA 6AGAGGGAAA GTATCCAAAG 5340 

AGAG6CCAAA AAGTACAACC TCACATCAAC CAATAGGCCG 6AG6AGGAAG CTAGAGGAAT 5400 

AGTGATTAGA GACCCAAHG GGACCTAAH GGGACCCAAA THCTCAAGT GGAQGGAGAA 5460 

CTTTTGACGA THCCACCGG TATCTCCTCG TGGGTATTCA GG6AGCTGCT CAGAAACCTA 5520 

TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA TGAGTCACCA GGAGTCHTT 5580 

TAGAGCACCT CCAGGAGGCT TATCGGATTT ACACCCCTTT TGACCTGGCA GCCCCCGAAA 5640 

ATAGCCATGC TCTTAATTrG GCATTTGTGG CTCAGGCAGC CCCAGATAGT AAAAGGAAAC 5700 

TCCAAAAACT AGAGGGATTT TGCT6GAATG AATACCAGTC AGCTTTTAGA GATAGCCTAA 5760 

AAGGTTTnG ACAGTCAAGA GGTTGAAAAA CAAAAACAAG CAGCTCAQGC AGCTGAAAAA 5820 

AGCCACTGAT AAAGCATCCT G6AGTATCAG AGTHACTGI TAGATCAGCC TCATTTGACT 5880 

TCCCCTCCCA CATG6TGTTT AAATCCAGCT ACACTACnC CTGACTCAAA CTCCACTATT 5940 

CCTGTTCATG ACTGTCA6GA ACTGTTGGAA ACTACTGAAA CTG6CCGACC TGATCTTCAA 6000 
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AATGTGCCCC TAGGAAAGGT GGATGCCACC GTGTTCACAG ACAGTA6CAG CTTCCTCGAG 6060 

AAGGGACTAC GAAAG6CCGG TGCAGCTGTT ACCATG6AGA CAGATGTGTT GTGGGCTCAG 6120 

GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA TCGCCCTCAC TCAGGCTCTC 6180 

CGATGGGGTA AGGATAHAA CGTTAACACT GACAGCAGGT ACGCCTHGC TACTGTGCAT 6240 

GTACGTGGAG CCATCTACCA GGAGCGTGGG CTACTCACCT CAGCAGGTGG CTGTAATCCA 6300 

CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT GGTAACCAGA AAGCTGATTC 6360 

AGCAGCTCAA GATGCA6TGT 6ACTTTCAGT CACGCCTCTA AACHGCTGC CCACAGTCTC 6420 

CTTTCCACA6 CCAGATCTGC CTGACAATCC CGCATACTCA ACAGAAGAAG AAAACTGGCC 6480 

TCAGAACTCA GAGCCAATAA AAATCAGGAA GGTTGGTGGA TTCTTCCTGA CTCTAGAATC 6540 

TTCATACCCC GAACTCTTGG GAAAACTTTA ATCAGTCACC TACAGTCTAC CACCCAnTA 6600 

6GAGGA6CAA AGCTACCTCA GCTCCTCCQG AGCCGTTTTA AGATCCCCCA TCTTCAAAGC 6660 

CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC AGGTAAATGC CAAAAAAGGT 6720 

CCTAAACCCA 6CCCA6GCCA CCGTCTCCAA GAAAACTCAC CAG6AGAAAA GTGGGAAATT 6780 

GACTTTACAG AA6TAAAACC ACACCGGGCT GGGTACAAAT ACCHCTAGT ACTGGTAGAC 6840 

ACCTTCTCTG GATGGACTGA AGCATTTGCT ACCAAAAACG AAACTGTCAA TAT6GTAGTT 6900 

AAGTTTTTAC TCAATGAAAT CATCCCTCGA CGTGGGCTGC CT6TTGCCAT AGGGTCT6AT 6960 

AATGGACCGG CCTTCGCCTT GTCTATAGTT TAGTCAGTCA GTAAGGCGH AAACATTCAA 7020 

T6GAAGCTCC ATTGTGCCTA TCGACCCCAG AGCTCTGGGC AAGTAGAACG CATGAACTGC 7080 
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ACCCTAAAAA ACACTCHAC AAAATTAATC TTAGAAACCG GTGTAAATTG TGTAAGTCTC 7140 

CTTCCTTTAG CCCTACTTAG AGTAAGGTGC ACCCCTTACT GG6CTGGGTT CTTACCTTTT 7200 

GAAATCATGT ATGGGAGGGC GCTGCCTATC TTGCCTAAGC TAAGAGATGC CCAATTGGCA 7260 

AAAATATCAC AAACTAATTT ATTACAGTAC CTACAGTCTC CCCAACAGGT ACAAGATATC 7320 

ATCCTGCCAC TTGTTCGAGG AACCCATCCC AATCCAATTC CTGAACAGAC AGGGCCCTGC 7380 

CATTCATTCC CGCCAGGTGA CCTGTTGTTT GTTAAAAAGT TCCAGAGAGA AGGACTCCCT 7440 

CCTGCTT6GA AGAGACCTCA CACCGTCATC ACGATGCCAA CGGCTCTGAA GGTGGAT6GC 7500 

ATTCCTGCGT GGAHCATCA CTCCCGCATC AAAAAGGCCA ACGGAGCCCA ACTAGAAACA 7560 

TGG6TCCCCA GGGCTGGGTC AG6CCCCTTA AAACTGCACC TAAGTTGGGT GAAGCCATTA 7620 

GATTAATTCT TTTTCTTAAT TTTGTAAAAC AATGCATAGC nCTGTCAAA CTTATGTATC 7680 

TTAAGACTCA ATATAACCCC CTTGTTATAA CTGAG6AATC AATGATTT6A TTCCCCAAAA 7740 

ACACAAGTGG GGAATGTAGT GTCCAACCTG GITHTACTA ACCCTGTTTT TAGACTCTCC 7800 

CTTTCCTTTA ATCACTCAGC CTTGTTTCCA CCTGAATTGA CTCTCCCTTA 6CTAAGAGCG 7860 

CCAGATGGAC TCCATCTTGG CTCTTTCACT GGCAGCCGCT TCCTCAAGGA CnAACTTGT 7920 

GCAAGCTGAC TCCCAGCACA TCCAAGAATG CAAHAACTG ATAAGATACT GTGGCAAGCT 7980 

ATATCCGCA6 TTCCCAGGAA TTCGTCCAAT TGATTACACC CAAAAGCCCC GCGTCTATCA 8040 

CCTTGTAATA ATCTTAAAGC CCCTGCACCT GGAACTATTA ACGTTCCTGT AACCATHAT 8100 

CCTTTTAACT TTTHGCCTA CTTTATTTCT GTAAAATTGT TTTAACTAGA CCCCCCCTCT 8160 
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CCTTTCTAAA CCAAAGTATA AAAGCAAATC TAGCCCCTTC TTCAGGCCGA GAGAATTTCG 8220 

A6C6TTAGCC GTCTCTTG6C CACCAGCTAA ATAAACGGAT TCTTCATGT6 TCTCAAAGT6 8280 

TGGC6TTTTC TCTAACTCGC TCAGGTACGA CCGTGGTAGT ATTTTCCCCA ACGTCTTATT 8340 

THAGGGCAC GTATGTAGAG TAACTTTTAT GAAAGAAACC AGTTAAGGAG GTTTTGGGAT 8400 

TTCCTTTATC AACTGTAATA CTG6TTTTGA TTATTTATTT ATTTAHTAT TTTTHTGAG 8460 

AAGGAGTTTC ACTCTTGHG CCCAGGCTGG AGTGCAATGG TGCGATCTTG GCTCACTGCA 8520 

ACTTCCGCCT CCCAGGHCA AGCGATTCTC CTGCCTCAGC CTCGAGAGTA GCTGGGATTA 8580 

TAGGCATGCG CCACCACACC CAGCTAATTT TGTATtTTTA GTAAAGATGG GGTTTCTTCA 8640 

TGTTGGTCAA GCT6GTCTGG AACTCCCC6C CTCGGGTGAT CTGCCCGCCT CGGCCTCCGA 8700 

AAGTGCTGGG ATTACAG6T6 T6ATCCACCA CACCCAGCC6 ATTTATATGT ATATAAATCA 8760 

CATTCCTCTA ACCAAAATGT AGIGHTCCT TCCATCTTGA ATATAGGCTG TAGACCCCGT 8820 

G6GTATGGGA CATTGTTAAC AGT6AGACCA CAGCAGTTTT TATGTGATCT GACA6CATCT 8880 

CCAAATAGCC TTCATGGTTG TCACTGCTTC CCAAGACAAT TCCAAATAAC ACTTCCCAGT 8940 

GATGACTTGC TACTT6CTAT TGTTACTTAA TGTGTTAAGG TGGCT6TTAC AGACACTATT 9000 

AGTAT6TCAG 6AATTACACC AAAATTTAGT GGCTCAAACA ATCATTTTAT TATGTATGTG 9060 
GATTCTCATG GTCAGGTCAG GATTTCA6AC AGGGCACAAG GGTAGCCCAC TTGTCTCTGT 9120 
CTATGATGTC T6GCCTCAGC ACAGGAGACT CAACAGCTG6 GGTCTGGGAC CAniGGAGG 9180 
CTTGTTCCCT CACATCTGAT ACCTGGCTTG G6ATGTTGGA AGAGGGGGTG AGCTGA6ACT 9240 
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GAGTGCCTAT ATGTAGTGTT TCCATATGGC CTTGACTTCC TTACAGCCTG GCA6CCTCAG 9300 
GGTAGTCAGA ATTCTTAGGA GGCACAGGGC TCCAGGGCAG ATGCTGAGGG GTCTHTATG 9360 
AGGTAGCACA GCAAATCCAC CCAG6ATC 9388 
(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:142: 

T6TAAGTCGA GCAGTGTGAT GGAAG6AATG GTCTTTG6AG AGAGCATATC CATCTCCTCC 60 

TCACTGCCTC CTAATGTCAT GAGGTACACT GAGCAGAATT AAACAGGGTA GTCTTAACCA 120 

CACTATTTTT AGCTACCTTG TCAAGCTAAT G6TTAAAGAA CACTTTTGGT TTACACTTGT 180 

TGGGTCATAG AAGTTGCTTT CCGCCATCAC GCAATAA6TT TGTGTGTAAT CAGAAGGAGT 240 

TACCTTATGG HTCAGTGTC ATTCTTTAGT TAACHGGGA GCTGTGTAAT TTAGGCTTTG 300 

CGTATTATH CACHCTGTT CTCCACHAT GAAGTGATTG TGTGTTCGCG TGTGTGTGCG 360 

TGCGCATGTG CTTCCGGCAG TTAACATAAG CAAATACCCA ACATCACACT GCTCGACTT 419 
(2) INFORMATION FOR SEQ ID NO: 143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 402 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:143: 

TGTAAGTCGA 6CAGTGTGAT GTCCACTGCA GTGTGTTGCT GGGAACAGTT AATGAGCAAA 60 

TTGTATACAA TGGCTAGTAC ATT6ACCGGG ATTTGTTGAA GCTGGTGAGT GTTATGACTT 120 

AGCCTGTTAG ACTAGTCTAT GCACATGGCT CTGGTCAACT ACCGCTCTCT CATTTCTCCA 180 

GATAAATCCC CCATGCTTTA TATTCTCTTC CAAACATACT ATCCTCATCA CCACATAGTT 240 

CCTTTGTTAA TGCTTTGTTC TAGACTTTCC CTTTTCTGTT TTCTTATTCA AACCTATATC 300 

TCTTTGCATA GATTGTAAAT TCAAATGCCC TCA6GGTGCA GGCAGTTCAT GTAAGGGAGG 360 

GAGGCTAGCC AGTGAGATCT GCATCACACT GCTCGACTTA CA 402 
(2) INFORMATION FOR SEQ ID NO: 144: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:144: 

TCG6GTGATG CCTCCTCAGG CCAAGAAGAT AAAGCHCAG ACCCCTAACA CATTTCCAAA 60 

AAGGAAGAAA 66AGAAAAAA GGGCATCATC CCCGTTCCGA AGGGTCAGGG AGGAGGAAAT 120 

IGAGGTGGAT TCACGAGTTG CG6ACAACTC CTTTGATGCC AAGCGAGGTG CAGCCG6AGA 180 
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CTG6GGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 224 
(2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 145: 
AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AGGAATAAAA 60 
ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA GGCATCACCC G HI 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:146: 

TAGCATGnG AGCCCAGACA CTTGTAGAGA GAGGAGGACA GTTAGAAGAA GAAGAAAAGT 60 

TTTTAAATGC TGAAAGTTAC TATAAGAAA6 CTTTGGCTTT GGATGA6ACT TTTAAA6ATG 120 

CAGAGGATGC TTT6CAGAAA CTTCATAAAT ATATGCAGGT GATTCCTTAT TTCCTCCTAG 180 

AAATTTAGTG ATAHTGAAA TAATGCCCAA ACTTAATTTT CTCCTGAGGA AAACTATTCT 240 
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ACATTACTTA AGTAAGGCAT TATGAAAAGT TTCTTTTTAG GTATAGTTTT TCCTAATTGG 300 

GTTTGACATT GCTTCATAGT GCCTCTGTTT TTGTCCATAA TC6AAAGTAA AGATAGCTGT 360 

GAGAAAACTA TTACCTAAAT TTGGTATGTT GHTTGAGAA ATGTCCTTAT AGGGA6CTCA 420 

CCTGGTGCn TTTAAATTAT TGTT6CTACT ATAATTGAGC TAATTATAAA AACCTTTTTG 480 

AGACATAin TAAATTGTCT HTCCTGIAA TACTGATGAT GATGTTTTCT CATGCATTTT 540 

CTTCTGAATT GGGACCATTG CTGCTGTGTC TGGGCTCACA T6CTA 585 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:147: 

TAGCATGTTG AGCCCAGACA CTGGGCAGCG GGGGTGGCCA CGGCAGCTCC TGCCGAGCCC 60 

AAGCGTGTTT GTCT6TGAAG GACCCTGACG TCACCTGCCA GGCTAGGGAG GGGTCAATGT 120 

GGAGTGAATG TTCACCGACT TTCGCAGGAG TGTGCAGAAG CCAGGTGCAA CTTGGnTGC 180 

TTGTGTTCAT CACCCCTCAA GATATGCACA CTGCTTTCCA AATAAAGCAT CAACTGTCAT 240 

CTCCAGATGG GGAAGACHT TTCTCCAACC AGCAGGCAGG TCCCCATCCA CTCA6ACACC 300 

AGCACGTCCA CCTTCTCGGG CAGCACCAC6 TCCTCCACCT TCTGCTGGTA CACG6TGATG 360 
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ATGTCAGCAA AGCCGTTCTG CANGACCAGC TGCCCCGTGT GCTGTGCCAT CTCACTGGCC 420 

TCCACCGCGT ACACCGCTCT AGGCCGCGCA TANTGTGCAC AGAANAAATG ATGATCCAGT 480 

CCCACAGCCC ACGTCCAAGA NGACHTATC CGTCAGGGAT TCTTTATTCT GCAGGAT6AC 540 

CTGTGGTATT AATTGHCGT GTCTGGGCTC AACATGCTA 579 
(2) INFORMATION FOR SEQ ID NO: 148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

TGACACCTTG TCCAGCATCT GCAAGCCAGG AAGAGAGTCC TCACCAAGAT CCCCACCCCG 60 

HGGCACCAG GATCTTGGAC TTCCAATCTC CAGAACTGTG A6AAATAAGT ATTTGTCGCT 120 

AAATAAATCT TTGTGGTTTC AGATATTTAG CTATAGCA6A TCAGGCTGAC TAAGAGAAAC 180 

CCCATAAGAG TTACATACTC ATTAATCTCC GTCTCTATCC CCAGGTCTCA GATGCT6GAC 240 

AA6GTGTCA 249 

(2) INFORMATION FOR SEQ ID N0:149: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 255 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:149: 

TGACACCTTG TCCAGCATCT 6CTATTTTGT GACTTJTTAA TAATAGCCAT TCTGACT6GT 60 

GTGAGATGGT AACTCATTGT GGGTTTGGTC TGCATTTCTC TAATGATCAG TGATAHAAG 120 

CTTTTTTTAA ATATGCTTGT TGACCACATG TATATCATCT TTTGAGAAGT GTCTGTTCAT 180 

ATCCTTTGCC CACTTTTTAA TTHTTTATC TTGTAAATTT GTTTAATTTC CTTACAGATG 240 

CTGGACAAGG TGTCA 255 
(2) INFORMATION FOR SEQ ID NO; 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(XI) SEQUENCE DESCRIPTION: SEQ ID N0:150: 

TTACGCTGCA ACACTGTGGA 6GCCAAGCTG GGATCACTTC TTCAHCTAA CTGGAGA6GA 60 

GGGAAGTTCA AGTCCAGCAG AGGGTGGGTG GGTAGACAGT GGCACTCAGA AATGTCAGCT 120 

GGACCCCTGT CCCCGCATAG GCAGGACAGC AAGGCTGTGG CTCTCCAGGG CCAGCTGAAG 180 

AACAG6ACAC TGTCTCCGCT GCCACAAAGC GTCAGAGACT CCCATCHTG AAGCACGGCC 240 

nCTTGGTCT TCCTGCACTT CCCTGTTCTG TTAGAGACCT GGTTATAGAC AAGGCTTCTC 300 

CACAGTGTTG CAGC6TAA 318 
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(2) INFORMATION FOR SEO ID NO; 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: aicleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:151: 

TNACGCNGCN ACNNTGTAGA GANGGNAAGG CNTTCCCCAC ATTNCCCCTT CATNANAGAA 60 

TTATTCNACC AAGNNTGACC NATGCCNTTT AT6ACTTACA TGCNNACTNC NTAATCT6TN 120 

TCNNGCCTTA AAAGCNNNTC CACTACATGC NTCANCACTG TNTGTGTNAC NTCATNAACT 180 

GTCNGNAATA GGGGCNCATA ACTACAGAAA TGCANTTCAT ACTGCHCCA NTGCCATCNG 240 

CGTGTGGCCT TNCCTACTCT TCTTNTATTC CAAGTAGCAT CTCTGGANTG CTTCCCCACT 300 



CTCCACATTG TTGCAGCNAT AAT 

(2) INFORMATION FOR SEQ ID NO: 152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



323 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:152: 



TCAAGATTCC ATAGGCTGAC CAGTCCAAGG AGAGTTGAAA TCATGAAGGA 6AGTCTATCT 



60 
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G6AGAGAGCT GTAGTTTTGA GGGTTGCAAA 6ACTTAGGAT GGAGTTGGTG GGTGTGGTTA 
GTCTCTAAGG TTGATTTTGT TCATAAATTT CATGCCCTGA ATGCCTTGCT TGCCTCACCC 
TGGTCCAAGC CTTAGTGAAC ACCTAAAAGT CTCTGTCTTC TTGCTCTCCA AACTTCTCCT 
GAGGATTTCC TCAGATTGTC TACATTCA6A TCGAAGCCAG TTGGCAAACA AGATGCAGTC 
CAGAGGGTCA G 

(2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xD SEQUENCE DESCRIPTION: SEQ ID NO: 153: 
CAAGATTCCA TAGGCTGACC AGGAGGCTAT TCAAGATCTC TGGCAGTTGA GGAAGTCTCT 
TTAAGAAAAT AGTTTAAACA ATTTGTTAAA ATTTTTCTGT CTTACTTCAT TTCTGTAGCA 
GTTGATATCT G6CTGTCCTT TTTATAATGC AGA6TQGGAA CTTTCCCTAC CATGTTTGAT 
AAATGTTGTC CAGGCTCCAT TGCCAATAAT GTGHGTCCA AAATGCCTGT TTAGTTTTTA 
AA6ACGGAAC TCCACCCTTT GCTTGGTCTT AAGTATGTAT GGAATGTTAT GATAGGACAT 
AGTAGTAGCG GTGGTCAGCC TATGGAATCT TG 
(2) INFORMATION FOR SEQ ID NO: 154: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

TCAAGATTCC ATAGGCTGAC CTGGACAGAG ATCTCCTGGG TCTGGCCCAG GACAGCAGGC 60 

TCAAGCTCAG TGGAGAAGGT TTCCATGACC CTCAGATTCC CCCAAACCTT GGATTGGGTG 120 

ACATTGCATC TCCTCAGAGA GGGAGGAGAT 6TANGTCT6G GCTTCCACAG GGACCTGGTA 180 

TTHAGGATC AGGGTACCGC TGGCCTGAGG CHGGATCAT TCANAGCCTG GGGGTGGAAT 240 

GGCTGGCAGC CTGTGGCCCC ATTGAAATAG GCTCTGGGGC ACTCCCTCTG TTCCTANTTG 300 

AACTTGGGTA AGGAACAGGA ATGTGGTCAN CCTATGGAAT CTTGA 345 

(2) INFORMATION FOR SEO ID NO: 155: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 295 base pairs 
(8) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 
GACGCTTGGC CACTTGACAC ATTAAACAGT TTTGCATAAT CACTANCATG TAHTCTAGT 60 
TTGCTGTCTG CTGTGATGCC CTGCCCTGAT TCTCTGGCGT TAATGAT6GC AAGCATAATC 120 
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AAACGCTGTT CTGTTAATTC CAAGTTATAA CTGGCATTGA TTAAAGCATT ATCHTCACA 180 

ACTAAACTGT TCTTCATANA ACA6CCCATA TTATTATCAA ATTAAGAGAC AATGTATTCC 240 

AATATCCTTT ANGGCCAATA TATTTNATGT CCCTTAATTA AGAGCTACTG TCCGT 295 
(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 156: 

GACGCTTGGC CACTTGACAC TGCAGTGGGA AAACCAGCAT GAGCCGCTGC CCCCAAGGAA 60 

CCTCGAAGCC CAGGCA6AGG ACCAGCCATC CCAGCCTGCA GGTAAAGTGT GTCACCTGTC 120 

AGGTGGGCTT GGGGTGAGTG GGTGGGGGAA GTGTGTGTGC AAAGGGGGTG TNAATGTNTA 180 

TGCGTGTGAG CATGAGTGAT GGCTAGTGTG ACTGCATGTC AGGGAGTGTG AACAAGCGTG 240 

CGGGGGTGTG TGTGCAAGTG CGTATGCATA TGAGAATATG TGTCTGTGGA TGAGTGCATT 300 

TGAAAGTCTG TGTGTGTGCG TGTGGTCATG ANGGTAANTT ANTGACTGCG CAGGATGTGT 360 

GAGTGTGCAT GGAACACTCA NTGTGTGTGT CAAGTGGCCN ANCGTC 406 
(2) INFORMATION FOR SEQ ID NO: 157: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION; SEQ ID NO: 157: 

TGACGCTTGG CCACTTGACA CACTAAAGGG TGTTACTCAT CACTTTCTTC TCTCCTCGGT 60 

GGCATGTGAG TGCATCTATT CACHGGCAC TCATTTGTTT GGCAGTGACT GTAANCCANA 120 

TCTGATGCAT ACACCAGCTT GTAAATTGAA TAAAT6TCTC TAATACTATG TGCTCACAAT 180 



ANGGTANGGG TGAGGAGAA6 GGGAGAGA 
(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



208 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 158: 

CTTCAACCTC CTTCAACCTC CTTCAACCTC CTGGATTCAA ACAATCATCC CACCTCAGAC 60 

TCCTTAGTAG CTGAGACTAC AGACTCACGC CACTACATCT GGCTAAATTT TTGTAGAGAT 120 

AGGGTTTCAT CATGTTGCCC T6GCTGGTCT CAAACTCCTG ACCTCAAGCA ATGTGCCCAC 180 

CTCAGCCTCC CAAAGTGCTG GGAHACAGG CATAAGCCAC CATGCCCAGT CCATNTTTAA 240 

TCTTTCCTAC CACATTCTTA CCACACTTTC TTTTATGTTT AGATACATAA ATGCTTACCA 300 

TTATGATACA ATTGCCCACA GTATTAA6AC AGTAACAT6C TGCACAGGTT T6TA6CCTAG 360 
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GAACAGTAGG CAATACCACA TAGCHAGGT GTGTGGTAGA CTATACCATC TAGGTTTGTG 420 

TAAGTTACAC TTTATGCTGT TTACACAATG ACAAAACCAT CTAATGATGC ATTTCTCAGA 480 

ATGTATCCTT GTCAGTAAGC TATGAT6TAC AGGGAACACT GCCCAAGGAC ACAGATATTG 540 

TACCTGT 547 
(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

GCTCCTCTTG CCTTACCAAC TCACCCAGTA TGTCAGCAAT TTTATCRGCT TTACCTACGA 60 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCT6AAAA 6TTCAGGCAA CAATCGCCTT 120 

CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CHTCTCTTT TCCTAGAACA TGCATTTARG 180 

TCGATAGAAG TTCCTCTCAG TGC 203 
(2) INFORMATION FOR SEQ ID NO: 160: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(x1) SEQUENCE DESCRIPTION: SEQ ID N0:160: 
TGTAAGTCGA GCAGTGTGAT GGGTGGAACA GGGTTGTAAG CAGTAATTGC AAACTGTATT 60 
TAAACAATAA TAATAATATT TAGCATHAT AGAGCACTTT ATATCTTCAA AGTACTTGCA 120 
AACATTAYCT AATTAAATAC CCTCTCTGAT TATAATCTGG ATACAAATGC ACTTAAACTC 180 
AG6ACAGGGT CATGAGARAA GTATGCATTT GAAAGTTG6T GCTAGCTATG CTTTAAAAAC 240 
CTATACAATG ATG6GRAAGT TAGAGTTCAG ATTCTGTTGG ACTGTTTTTG TGCATTTCAG 300 
TTCAGCCTGA TGGCAGAATT AGATCATATC TGCACTC6AT GACTYTGCTT GATAACTTAT 360 
CACTGAAATC TGAGTGTTGA TCATCACACT GCTCGACTTA CA 402 
(2) INFORMATION FOR SEQ ID NO: 161: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:161: 

AGCATGTTGA GCCCAGACAC TGACCAGGAG AAAAACCAAC CAATAGAAAC ACGCCCAGAC 60 

ACTGACCAGG AGAAAAACCA ACCAATAAAA ACAGGCCCGG ACATAAGACA AATAATAAAA 120 

TTAGCGGACA AGGACATGAA AACAGCTATT GTAAGAGCGG ATATAGTGGT GTGTGTCTGG 180 

GCTCAACATG CTA 193 
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(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:16?: 
TGTTGAGCCC AGACACTGAC CAGGAGAAAA ACCAACCAAT AAAAACAGGC CCGGACATAA 60 
GACAAATAAT AAAATTAGCG GACAAGGACA TGAAAACAGC TATTGTAAGA GCGGATATAG 120 
TGGTGT6TGT CTGGGCTCAA CATGCTA 147 
(2) INFORMATION FOR SEQ ID NO: 163: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:163: 

TAGCATGTTG AGCCCAGACA CAAATCTTTC CTTAAGCAAT AAATCATTTC TGCATATGTT 60 

TTTAAAACCA CAGCTAAGCC ATGATTATTC AAAAGGACTA TTGTATTGGG TATTTTGATT 120 

TGGGTTCTTA TCTCCCTCAC ATTATCHCA TTTCTATCAT TGACCTCTTA TCCCAGAGAC 180 



TCTCAAACTT TTATGTTATA CAAATCACAT TCTGTCTCAA AAAATATCTC ACCCACTTCT 



240 
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CTTCTGTTTC TGCGTGTGTA T6TGTGTGTG TGTGTGTCTG GGCTCAACAT GCTA 294 
(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:164: 

CGGGATTGGC TTTGAGCTGC AGATGCTGCC TGTGACCGCA CCCG6CGTGG AACAGAAAGC 60 

CACCTGGCTG CAAGTGCGCC AGAGCCGCCC TGACTACGTG CTGCTGTGGG GCTGGGGCGT 120 

GATGAACTCC ACCGCCCTGA AG6AAGCCCA 6GCCACCGGA TACCCCCGCG ACAA6ATGTA 180 

CGGCGTGTGG TGGGCCGGTG CGGAGCCCGA TGTGCGTGAC GTGGGCGAAG GCGCCAAGGG 240 

CTACAACGCG CTGGCTCTGA ACGGCTACGG CACGCAGTCC AAGGTGATCC ANGACATCCT 300 

GAAACACGTG CACGACAAG6 GCCAGGGCAC GGGGCCCAAA GACGAAGTGG GCTCGGTGCT 360 

GTACACCCGC GGCGTGATCA TCCAGATGCT GGACAAQGTG TCAATCACTA AT 412 
(2) INFORMATION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(x1) SEQUENCE DESCRIPTION: SEO ID N0:165: 

TTGACACCTT GTCCAGCATC TGCATCTGAT GAGAGCCTCA GATGGCTACC ACTAATGGCA 60 

GAAGGCAAAG GAGAACAGGC ATTGTATGGC AAGAAAGGAA GAAAGAGAGA GGGGA6AAAG 120 

GTGCTAGGTT CTTTTCAACA ACCAGTTCTT GATGGAACTG AGAGTAAGAG CTCAAGGCCA 180 

GGTGTGGTGA CTCCAACCAG TAATCCCAAC ATTTTAGGAG GCT6AGGCAG GCAGATGTCT 240 

TGACCCCATG AGTTTGTGAC CAGCCTGAAC AACATCATGA GACTCCATCT CTACAATAAT 300 

TACAAAAATT AATCAGGCAT TGTGGTATGC CCTGTAGTCC CAGATGCTGG ACAAGGTGTC 360 

A 361 
(2) INFORMATION FOR SEQ ID NO: 166: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



TWGACTGACT CATGTCCCCT ACACCCAACT ATCTTCTCCA 6GTGGCCAGG CATGATAGAA 60 



TCTGATCCTG ACTTA6GGGA ATATTTTCTT TTTACTTCCC ATCTTGATTC CCTGCCGGTG 120 



AGTTTCCTGG TTCAGGGTAA GAAAGGAGCT CAGGCCAAAG TAATGAACAA ATCCATCCTC 180 



ACA6AC6TAC AGAATAAGAG AACWTGGACW TAGCCAGCAG AACMCAAKTG AAAMCAGAAC 240 



MCTTAMCTAG GATRACAAMC MCRRARATAR KTGCYCMCMC WTATAATAGA AACCAAACH 300 
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6TATCTAATT AAATATTTAT CCACY6TCAG GGCATTAGTG GTTTTGATAA ATACGCTTTG 360 
GCTAGGATTC CTGAGGTTAG AATGGAARAA CAATT6CAMC GAGGGTAGGG GACATGAGTC 420 



AKTCTAA 

(2) INFORMATION FOR SEO ID NO: 167: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS : single 

(D) TOPOLOGY: linear 



427 



(x1) SEQUENCE DESCRIPTION: SEO ID NO: 167: 

AACGTCGCAT GCTCCCGGCC GCCATGGCCG CGGGATAGAC TGACTCATGT CCCCTAAGAT 60 

AGAGGAGACA CCTGCTAGGT GTAAGGAGAA GATGGTTAGG TCTACGGAGG CTCCAGGGTG 120 

GGAGTAGTTC CCTGCTAAGG GAGGGTAGAC TGTTCAACCT GTTCCTGCTC CGGCCTCCAC 180 

TATA6CAGAT GCGAGCAGGA GTAGGAGAGA GGGAGGTAAG AGTCAGAAGC HATGTTGTT 240 

TATGCGGGGA AACGCCRTAT CGGGGGCAGC CRAGTTATTA GGGGACANTR TAGWYARTCW 300 

AGNTAGCATC CAAAGCGNGG GAGTTNTCCC ATATGGTTGG ACCTGCAGGC GGCCGCATTA 360 

GTGATTAGCA TGTGAGCCCC AGACAC6CAT AGCAACAAG6 ACCTAAACTC A6ATCCTGT6 420 

CTGATTACTT AACATGAATT ATTGTATTTA TTTAACAACT TTGAGTTATG AGGCATATTA 480 
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TTAGGTCCAT ATTACCTGGA 500 
(2) INFORMATION FOR SEO ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:168: 

TTCATCGCTC GGTGACTCAA GCCTGTAATC CCAGAACTTT GGGAG6CCGA GGGGAGCAGA 60 

TCACCTGAGG TTGGGAGTTT GAGACCAGCC TGGCCAACAT GGTGACAACC CGTCTCTGCT 120 

AAAAATACAA AAATTAGCCA AGCATGGTGG CATGCACTTG TAATCCCAGC TACTCGGGAG 180 

GCTGAGGCAG GAGAATCACT TGAGGCCAGG AGGCAGAGGT TGCAGTGAG6 CAGAGGHGA 240 

GATCATGCCA CT6CACTCCA GCCTGGGCAA CAGAGTAAGA CTCCATCTCA AAAAAAAAAA 300 

AAAAAAAGAA TGATCAGAGC CACAAATACA GAAAACCTTG AGTCACCGAG CGATGAAA 358 
(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:169: 
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TTCTGTCCAC ACCAATCTTA GAGCTCTGAA AGAATTTGTC TTTAAATATC TTTTAATAGT 60 

AACATGTATT TTATG6ACCA AAHGACATT TTCGACTATT TTTTCCCAAA AAAAGTCA6G 120 

TGAATTTCAG CACACTGAGT TGGGAATTTC TTATCCCAGA AGWCGGCACG AGCAATTTCA 180 

TATTTATTTA A6ATT6ATTC CATACTCCGT TTTCAAG6AG AATCCCTGCA GTCTCCTTAA 240 

AGGTAGAACA AATACTTTCT ATTTTTTTTT CACCATTGTG GGATTGGACT TTAAGAGGTG 300 

ACTCTAAAAA AACAGAGAAC AAATATGTCT CAGTTGTATT AAGCAC6GAC CCATATTATC 360 

ATATTCACTT AAAAAAATGA TTTCCTGTGC ACCTTTTGGC AACTTCTCTT TTCAATGTAG 420 

GGAAAAACTT AGTCACCCTG AAAACCCACA AAATAAATAA AACTTGTAGA T6TGGGCAGA 480 

ARGTTTG6GG GTGGACATTG TATGTGTTTA AATTAAACCC T6TATCACTG A6AAGCT6TT 540 

GTATGGGTCA GAGAAAATGA ATGCTTAGAA GCTGTTCACA TCTTCAAGAG CAGAAGCAAA 600 

CCACATGTCT CAGCTATATT ATTATTTAH TTTTATGCAT AAA6TGAATC ATTTCTTCTG 660 

TATTAATTTC CAAAGGGTTT TACCCTCTAT TTAAATGCTT TGAAAAACAG TGCATTGACA 720 

ATGGGTTGAT ATTTTTCTTT AAAAGAAAAA TATAAHATG AAAGCCAAGA TAATCTGAAG 780 

CCTGTTnAT TTTAAAACTT TTTATGTTCT GIGGHGATG TTGTTTGTTT GTHGTITCT 840 

ATTTTGTTGG TTTTTTACTT TGTTTTTTGT TTTGTTTTGT TTTGGnTDG CATACTACAT 900 

GCAGTTTCTT TAACCAATGT CTGTTTGGCT AATGTAATTA AAGTTGTTAA TTTATATGAG 960 

TGCATTTCAA CTATGTCAAT GGTHCHAA TATTTATTGT GTAGAAGTAC TG6TAATTTT 1020 

TTTATTTACA ATATGTTTAA AGAGATAACA GTTTGATATG TTTTCATGTG TTTATAGCAG 1080 
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AAGTTATTTA TTTCTATGGC ATTCCAGCGG ATATTTTGGT GTTTGCGAGG CATGCAGTCA 1140 

ATATTTTGTA CAGTTAGTGG ACAGTATTCA GCAACGCCTG ATA6CTTCTT TGGCCTTATG 1200 

TTAAATAAAA AGACCTGTTT GGGATGTAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1260 

AAAAA 1265 
(2) INFORMATION FOR SEQ ID NO: 170: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3B3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

TGTAAGTCGA GCAGTGTGAT GACGATATTC TTCTTAHAA TGTGGTAATT GAACAAATGA 60 

TCTGT6ATAC TGATCCTGAG CTAGGAGGCG CTGTTCAGTT AATGGGACTT CTTCGTACTC 120 

TAATTGATCC AGAGAACATG CTGGCTACAA CTAATAAAAC CGAAAAAAGT GAATTTCTAA 180 

ATTTHTCTA CAACCATTGT ATGCATGTTC TCACAGCACC ACTTTTGACC AATACTTCAG 240 

AAGACAAATG TGAAAAGGAT AATATAGTTG GATCAAACAA AAACAACACA ATHGTCCCG 300 

ATAATTATCA AACAGCACAG CTACTTGCCT TAATTTTAGA GHACTCACA TTTTGT6TGG 360 

AACATCACAC TGCTC6ACTT ACA 383 
(2) INFORMATION FOR SEQ ID NO: 171: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:171: 

TGGGCACCTT CAATATCGCA AGTTAAAAAT AATGTTGAGT TTATTATACT TTTGACCTGT 60 

TTAGCTCAAC AGGGTGAAGG CATGTAAAGA ATGTGGACTT CTGAGGAATT TTCTTTTAAA 120 

AAGAACATAA TGAAGTAACA THTAATTAC TCAAGGACTA CTTTTGGTTG AAGTTTATAA 180 

TCTAGATACC TCTACTTTTT GTHTTGCTG TTCGACAGTT CACAAAGACC TTCAGCAATT 240 

TACAGGGTAA AATCGTTGAA GTAGTGGAGG TGAAACTGAA ATTTAAAATT ATTCTGTAAA 300 

lACTATAGGG AAAGAGGCTG AGCTTAGAAT CTTTTGGTTG TTCATGTGTT CTGTGCTCTT 360 



ATCATCACAC TGCTCGACH ACA 

(2) INFORMATION FOR SEQ ID NO: 172: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



383 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 



TCGG6TGATG CCTCCTCAGG CTTGTCGTTA GTGTACACAG AGCTGCTCAT GAAGC6ACAG 60 
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CGGCTGCCCC TGGCACTTCA GAACCTCTTC CTCTACACTT TTGGTGCGCT TCTGAATCTA 120 



GGTCTGCATG CTGGCGGCGG CTCTGGCCCA GGCCTCCTGG AAAGHTCTC AGGATGGGCA 180 



GCACTCGTGG TGCTGAGCCA GGCACTAAAT G6ACTGCTCA TGTCTGCTGT CAT6GA6CAT 240 



GGCAGCAGCA TCACACGCCT CTTTGTGGT6 TCCTGCTCGC TGGTQGTCAA CGCCGTGCTC 300 



TCAGCAGTCC TGCTACGGCT GCAGCTCACA GCC6CCTTCT TCCTGGCCAC ATTGCTCATT 360 



6GCCTGGCCA TGC6CCTGTA CTATGGCAGC CGCTAGTCCC TGACAACTTC CACCCTGATT 420 



CCGGACCCTG TAGATTGGGC GCCACCACCA GATCCCCCTC CCAGGCCTTC CTCCCTCTCC 480 



CATCAGCGGC CCTGTAACAA GTGCCTTGTG AGAAAAGCTG GAGAAGTGAG GGCAGCCAGG 540 



TTATTCTCTG GAGGTTGGTG GATGAAGGGG TACCCCTAGG AGATGT6AAG TGTGGGTTTG 600 



GTTAAGGAAA TGCTTACCAT CCCCCACCCC CAACCAAGH NTTCCAGACT AAAGAATTAA 660 



GGTAACATCA ATACC TAGGC CTGAGGAGGC ATCACCCGA 



699 



(2) INFORMATION FOR SEQ ID N0:173: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 701 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 



TCG6GTGAT6 CCTCCTCAGG CCAGATCAAA CTTGGGGTTG AAAACTGTGC AAAGAAATCA 60 



ATGTCGGAGA AAGAATHTG CAAAAGAAAA ATGCCTAATC AGTACTAATT TAATAGGTCA 120 
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CATTAGCAGT GGAAGAAGAA ATGTTGATAT TTTATGTCAG CTATTTTATA ATCACCAGAG 
TGCTTAGCTT CATGTAAGCC ATCTCGTATT CATTAGAAAT AAGAACAATT TTATTCGTCG 
GAAAGAACTT TTCAATTTAT AGCATCTTAA HGCTCAGGA TTTTAAATTT TGATAAAGAA 
AGCTCCACTT TT6GCAG6AG TAGGGGGCAG GGAGAGAGGA GGCTCCATCC ACAAGGACAG 
AGACACCAGG GCCAGTAGGG TAGCTGGTG6 CTGGATCAGT CACAACGGAC TGACTTATGC 
CATGAGAAGA AACAACCTCC AAATCTCAGT TGCTTAATAC AACACAAGCT CATTTCTTGC 
TCACGTTACA TGTCCTATGT AGATCAACAG CAGGTGACTC AGGGACCCAG GCTCCATCTC 
CATATGAGCT TCCATAGTCA CCAGGACACG GGCTCTGAAA GTGTCCTCCA TGCAGGGACA 
CATGCCTCTT CCTTTCATTG GGCAGAGCAA GTCACTTAT6 GCCAGAAGTC ACACTGCAGG 
GCAGTGCCAT CCTGCTGTAT GCCTGA6GAG GCATCACCCG A 
(2) INFORMATION FOR SEQ ID N0:174: 

(1) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



PCTAJS97/00485 

180 
240 
300 
360 
420 
480 
540 
600 
660 
701 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:174: 
TC6GGTGATG CCTCCTCANG CCCCTAAATC AGAGTCCAGG GTCAGA6CCA CAGGAGACAG 
GGAAAGACAT AGATTTTAAC CGGCCCCCTT CAGGAGATTC TGAGGCTCAG TTCACTTTGT 



60 



120 
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TGCAGTTTGA ACAGAGGCAG CAAGGCTAGT GGTTAGGGGC ACGGTCTCTA AAGCTGCACT 180 

GCCTGGATCT GCCTCCCAGC TCTGCCAGGA ACCAGCTGCG TGGCCTTGAG CTGCTGACAC 240 

GCAGAAAGCC CCCTGTGGAC CCAGTCTCCT CGTCTGTAAG ATGAGGACAG GACTCTAGGA 300 

ACCCTTTCCC TIGGHIGGC CTCACTTTCA CAGGCTCCCA TCTTGAACTC TATCTACTCT 360 

TTTCCTGAAA CCTTGTAAAA GAAAAAAGTG CTAGCCTGGG CAACATGGCA AAACCCTGTC 420 

TCTACAAAAA ATACAAAAAT TAGHGGGTG TGGTGGCATG TGCCTGTAGT CCCAGCCACT 480 

TGGGAGGT6C TGAGGTGGGA GGATCACTTG AGCCCGGGAG GTGGAGGTTG CAGTGAGCCA 540 

AGATCATGCC ACTGCACTCC AGCCTGAGTA ATAGAGTAAG ACTCTGTCTC AAAAACAACA 600 

ACAACAACAG TGAGTGTGCC TCTGTTTCCG GGTTGGATGG GGCACCACAT TTATGCATCT 660 

CTCAGATTTG GACGCTGCAG CCTGAGGAGG CATCACCCGA 700 
(2) INFORMATION FOR SEQ ID N0:175: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:175: 

TATAQGGC6A ATTGGGCCCG AGTTGCATGN TCCCGGCCGC CATGGCCGCG GGATTCGGGT 60 

GATGCCTCCT CAGGCTTGTC TGCCACAAGC TACTTCTCTG AGCTCAGAAA GTGCCCCHG 120 

ATGAGGGAAA ATGTCCTACT GCACTGCGAA TTTCTCAGTT CCATTTTACC TCCCAGTCCT 180 



wo 97/25426 PCTAJS97y00485 

151 

CCTTCTAAAC CAGTTAATAA ATTCATTCCA CAAGTATTTA CTGATTACCT GCTTGTGCCA 240 

GGGACTATTC TCAGGCTGAA GAAGGTGGGA GGGGAGGGCG GAACCTGAGG AGCCACCTGA 300 

GCCAGCTTTA TATTTCAACC ATGGCTGGCC CATCTGAGAG CATCTCCCCA CTCTCGCCAA 360 

CCTATCGGGG CATAGCCCAG GGATGCCCCC AGGCGGCCCA GGTTAGATGC GTCCCTTTGG 420 

CTTGTCAGTG ATGACATACA CCTTAGCTGC TTAGCTGGTG CTGGCCTGAG GAGGCATCAC 480 

CCGA 484 
(2) INFORMATION FOR SEO ID NO: 176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:176: 

TCGGGTGATG CCTCCTCAGG GCTCAAGGGA TGAGAA6TGA CTTCTTTCTG 6AGGGACCGT 60 

TCATGCCACC CAGGATGAAA ATGGATAGGG ACCCACTTGG AGGACTTGCT GATATGHTG 120 

GACAAATGCC AGGTAGCGGA ATTGGTACTG GTCCAGGAGT TATCCAGGAT AGATTTTCAC 180 

CCACCATGGG ACGTCATCGT TCAAATCAAC TCTTCAATGG CCATGGGGGA CACATCATGC 240 

CTCCCACACA ATCGCAGHT GGAGAGATGG GAGGCAAGH TATGAAAAGC CAGGGGCTAA 300 

GCCAGCTCTA CCATAACCAG AGTCAGGGAC TCTTATCCCA GCTGCAAGGA CAGTCGAAGG 360 
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ATATGCCACC TCGGTTTTCT AAGAAAGGAC AGCTTAATGC AGATGAGATT AGCCTGAGGA 420 
GGCATCACCC GA 432 
(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

TAGCATGTTG AGCCCAGACA CA6TAGCATT TGTGCCAATT TCTGGHGGA ATGGTGACAA 60 

CATGCTGGAG CCAAGTGCTA ACATGCCTTG GTTCAAGGGA T66AAA6TCA CCCGTAAGGA 120 

TGGCAATGCC AGTGGAACCA CGCTGCTTGA GGCTCTGGAC TGCATCCTAC CACCAACTC6 180 

CCCAACTGAC AAGCCCTTGC GCCTGCCTCT CCAGGATGTC TACAAAATTG GTGGTATTGG 240 

TACTGTTCCT GTTGGCCGAG TGGAGACTGG TGTTCTCAAA CCCGGTATGG TGGTCACCTT 300 

TGCTCCAGTC AACGTTACAA CGGAAGTAAA ATCTGTCGAA ATGCACCATG AAGCTTTGAG 360 

TGAAGCTCTT CCT66GGACA ATGT6GGCTT CAATGTCAAG AATGTGTCT6 TCAAG6ATGT 420 

TC6TCGTGGC AAC6TTGCTG GTGACAGCAA AAATGACCCA CCAAT6GAAG CAGCTGGCTT 480 

CACTGCTC/tfi GTGATTATCC TGAACCATCC AGGCCAAATA AGTGCCGGCT ATGCCCCTGT 540 

AHGGATTGC CACACGGCTC ACATTGCATG CAAGHTGCT GAGCTGAAGG AAAAGATTGA 600 

TCGCC6TTCT GGTAAAAAGC TGGAAGATGG CCCTAAATTC TTGAAGTCTG GTGATGCTGC 660 
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CATTGTTGAT ATG6TTCCTG GCAAGCCCAT GTGTGTTGAG AGCTTCTCAG ACTATCCACC 720 
TTTGGGTCGC TTTGCTGTTC GTGATATGAG ACA6ACAGTT GCGGTG6GTG TCTGG6CTCA 780 



ACATGCTA 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



788 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0:178: 

TAGCATGTT6 AGCCCAGACA CCTGTGTTTC TQGGAGCTCT GGCAGTGGCG GAHCATAGG 60 

CACTTGGGCT GCACTTTGAA TGACACACTT GGCTTTATTA GAHCACTAG TTTTTAAAAA 120 

ATTGTTGTTC 6TTTCTTTTC ATTAAAGGTT TAATCAGACA GATCAGACAG CATAATTTTG 180 

TATTTAATGA CAGAAACGTT GGTACATTTC HCATGAATG A6CTTGCATT CTGAAGCAA6 240 

AGCCTACAAA AGGCACnGT TATAAATGAA AGTTCTGGCT CTAGAGGCCA GTACTCTGGA 300 

GTTTCAGAGC AGCCAGTGAT TGTTCCAGTC AGTGATGCCT AGTTATATAG AGGAGGAGTA 360 

CACTGTGCAC TCTTCTAGGT GTAAG6GTAT GCAACTTTGG ATCTTAAAAT TCTGTACACA 420 

TACACACTTT ATATATATGT ATGTAT6TAT GAAAACATGA AATTAGTTTG TCAAATATGT 480 

GTGTGHTAG TATTTTAGCT TAGT6CAACT ATnCCACAT TATTTATTAA AHGATCTAA 540 
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GACACTTTCT TGTTGACACC TT6AATATTA AT6TTCAAGG GTGCAATGTG TATTCCTTTA 600 



GATTGTTAAA GCTTAATTAC TATGATTTGT AGTAAATTAA CTTTTAAAAT GTATTTGAGC 660 



CCTTCTGTAG TGTC6TAGGG CTCTTACAGG 6TGG6AAA6A TTTTAATTTT CCAGTTGCTA 720 



ATT6AACAGT ATGGCCTCAT TATATATHT GATTTATAGG AGHTGIGTC TGGGCTCAAC 780 



AT6CTA 



786 



(2) INFORMATION FOR SEQ ID NO: 179; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:179: 



TAGCATGTTG AGCCCAGACA CTGGTTACAA GACCAGACCT GCTTCCTCCA TATGTAAACA 60 



GCTTTTAAAA AGCCAGTGAA CCTTTTTAAT ACTTTGGCAA CCTTCTTTCA CAGGCAAAGA 120 



ACACCCCCAT CCGCCCCHG HTGGAGTGC AGA6TTTGGC TTTG6TTCTT TGCCTTGCCT 180 



GGAGTATACT TCTAAHCCT GTTGTCCTGC ACAAGCTGAA TACCGAGCTA CCCACCGCCA 240 



CCCAGGCCAG GTTTCCACTC AHTATTACT TTATGTTTCT GTTCCATTGC TGGTCCACAG 300 



AAATAAGTTT TCCTTTG6AG GAAT6TGATT ATACCCCTTT AATTTCCTCC TTHGCTTH 360 



TTTTAATATC ATTGGTATGT GHTOGCCCA GAGGAAACT6 AAATTCACCA TCATCTTGAC 420 



TGGCAATCCC ATTACCATGC TTTTTTTAAA AAACGTAATT TTTCTTGCCT TACATTGGCA 480 
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GAGTAGCCCT TCCTGGCTAC TGGCTTAATG TAGTCACTCA GTTTCTAGGT GGCATTAG6C 540 

ATGAGACCTG AAGCACAGAC TGTCTTACCA CAAAAGGTGA CAAGATCTCA AACCTTAGCC 600 

AAAGGGCTAT GTCAGGniC AATGCTATCT GCTTCTGTTC CTGCTCACTG TTCTG6ATTT 660 

TGTCCTTCTT CATCCCTAGC ACCAGAATTT CCCAGTCTCC CTCCCTACCT TCCCTTGTTT 720 

TAATTCTAAT CTATCAGCAA AATAACTTTT CAAAIGTHT AACCGGTATC TCCAT6TGTC 780 

TGGGCTCAAC ATGCTA 796 
(2) INFORMATION FOR SEQ ID NO: 180: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180; 

GGATGTGCTG CAA6GCGATT AAGTTGGGTA ACGCCAGGGT TTTCCCAGTC ACGACGTTGT 60 

AAAACGACGG CCAGTGAAn GTAATACGAC TCACTATAGG GCGAAHGGG CCCGACGTCG 120 

CATGCTCCCG 6CCGCCATGG CCGCGGGATA GCATGTTGAG CCCAGACACC TGCAGGTCAT 180 

TTGGAGAGAT TTTTCACGTT ACCAGCHGA TGGTCTTTn CAGGAGGW3A GACACTGAGC 240 

ACTCCCAAGG TGAGGHGAA GATHCCTCT AGATAGCCGG ATAAGAAGAC TAGGAGGGAT 300 

GCCTAGAAAA TGAHAGCAT GCAAATTTCT ACCT6CCATT TCAGAACTGT 6TGTCA6CCC 360 
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ACATTCAGCT GCTTCTTGT6 AACTGAAAAG AGAGAGGTAT TGAGACTTTT CTGATGGCCG 420 
CTCTAACATT GTAACACAGT AATCTGTGT6 T6T6TGG6TG T6T6T6TGTG TCTG6GCTCA 480 
ACATGCTA 488 
(2) INFORMATION FOR SEQ ID NO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID N0:181: 

TAGCATGTTG AGCCCAGACA CGGCGACGGT ACCTGAT6AG TGGGGTGATG GCACCTGT6A 60 

AAAG6AGGAA CGTCATCCCC CATGATATTG GGGACCCAGA TGATGAACCA TGGCTCCGCG 120 

TCAATGCATA TTTAATCCAT GATACTGCTG ATTGGAAGGA CCTGAACCTG AAGTTTGTGC 180 

TGCAGGTTTA TCGGGACTAT TACCTCACGG GTGATCAAAA CTTCCTGAAG GACATGTGGC 240 

CTGT6TGTCT AGTAAGGGAT GCACATGCAG TGGCCAGTGT GCCAGGGGTA TGGHGGTGT 300 

CTGGGCTCAA CATGCTA 317 
(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 

TA6CAT6TTG AGCCCAGACA CTGGCTGTTA GCCAAATCCT CTCTCAGCTG CTCCCTGTGG 60 

TTTGGTGACT CAGGATTACA GAGGCATCCT GTTTCAGGGA ACAAAAA6AT TTTAGCTGCC 120 

AGCAGAGAGC ACCACATACA TTAGAATGGT AAGGACTGCC ACCTCCTTCA AGAACAGGAG 180 

TGAGGGTGGT G6TGAATGGG AATGGAAGCC TGCATTCCCT GATGCATTTG TGCTCTCTCA 240 

AATCCTGTCT TA6TCTTA66 AAAGGAAGTA AAGTTTCAAG GACGGTTCCG AACTGCTTTT 300 

TGTGTCT6GG CTCAACATGC TATCCCGCGG CCATGGCGGC CGGGAGCATG CGAC6TCGGG 360 

CCCAATTCGC CCTATAGTGA GTCGTAHAC AATTCACTGG CCGTCGTTTT ACAACGTCGT 420 

GACTGGGAAA ACCCTGGCGT TACCCAACH AATCGCCTTG CA6CACATCC CCCTTTCCCA 480 

GCTGGCGTAA TANCGAAAAG GCCCGCA 507 
(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:183: 
GATTTACGCT GCAACACTGT G6AGGTAGCC CTGGAGCAAG GCAG6CATGG ATGCTTCTGC 
AATCCCCAAA TGGAGCCTGG TATTTCAGCC AGGAATCTGA GCAGAGCCCC CTCTAATTGT 



60 



120 
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AGCAATGATA AGTTATTCTC TTTGTTCTTC AACCTTCCAA TAGCCTTGA6 CTTCCAGGGG 180 
AGTGTCGTTA ATCATTACAG CCTG6TCTCC ACAGTGTTGC AGCGTAA 227 
(2) INFORMATION FOR SEQ ID NO: 184: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTACGCTGCA ACACTGTGGA GCAGATTAAC ATCAGACTTT TCTATCAACA TGACTGGGGT 60 

TACTAAAAAG ACAACAAATC AATGGCTTCA AAAGTCTAAG GAATAATHC 6ATACTTCAA 120 

CTTTATAAAA CCT6ACAAAA CTATCAATCA AGCATAAAGA CAGATGAAGA ACATTTCCAG 180 

ATTTTGGCCA ATCAGATAH TTACCTCCAC AGTGTTGCAG C6TAA 225 
(2) INFORMATION FOR SEQ ID NO: 18b: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:185: 
GGCCCGACGT CGCATGCTCC CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 60 
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CTGGGACCCA TAGGCTAGTC AGAGTATTTA GAGTTGAGTT CCTTTCTGCT TCCCAGAATT 120 

TGAAAGAAAA GGAGTGAGGT GATAGAGCTG AGAGATCAGA THGCCTCTG AAGCCTGTTC 180 

AAGATGTATG TGCTCAGACC CCACCACTGG GGCCTGTGGG TGAGGTCCTG GGCATCTATT 240 

T6AATGAATT GCT6AAGGGG AGCACTATGC CAAGGAAGGG GAACCCATCC TGGCACTGGC 300 

ACAGGG6TCA CCTTATCCAG TGCTCAGTGC TTCTTTGCTG CTACCTGGTT TTCTCTCATA 360 

TGTGAGGGGC AGGTAAGAAG AAGTGCCCRG TGTTGTGC6A GTTTTA6AAC ATCTACCAGT 420 

AA6TGGGGAA GTHCACAAA GCAGCAGCn TGTniGTGT ATTTTCACCT TCAGHAGAA 480 

GAGGAAGGCT GTGAGAT6AA TGHAGTTGA GTGGAAAAGA CGGGTAAGCT TAGTGGATAG 540 

AGACCCTAAC GAATCACTAG TGCGGCC6CC TTGCAGGTCG ACCATATGGG AGAGCTC 597 
(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 597 base pairs 

(B) TYPE: nucleic acicj- 

(C) STRANDEONESS; single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

GGCCCGAAGT TGCATGHCC CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 60 

CTACCTAAAA AATCCCAAAC ATATAACTGA ACTCCTCACA CCCAATTGGA CCAATCCATC 120 

ACCCCAGAGG CCTACAGATC CTCCTTTGAT ACATAA6AAA ATTTCCCCAA ACTACCTAAC 180 
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TATATCATTT TGCAAGATTT GniTACCM ATTTTGATGG CCTHCTGAG CTTGTCAGTG 240 

TGAACCACTA TTACGAACGA TC6GATATTA ACTGCCCCTC ACCGTCCAGG TGTAGCTG6C 300 

AACATCAAGT GCAGTAAATA TTCAnAAGT TTTCACCTAC TAA6GTGCTT AAACACCCTA 360 

GGGTGCCATG TCGGTAGCAG ATCTHTGAT TT6TTTTTAT TTCCCATAAG GGTCCTGTTC 420 

AAGGTCAATC ATACATGTAG TGT6AGCAGC TAGTCACTAT CGCATGACTT GGAGG6TGAT 480 

AATA6AGGCC TCCTTTGCTG TTAAAGAACT GTTGTCCCAG CCTGTCAAAG TGGATAGAGA 540 

CCCTAACGAA TCACTAGTGC GGCCGCCTGC AGGTCGACCA TATGGGAGAG CTCCCAA 597 

(2) INFORMATION FOR SEO ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 324 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPaOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 187: 

TCGTTAGGGT CTCTATCCAC TTGCAGGTAA AATCCAATCC TGTGTATATC TTATAGTCTT 60 

CCATATGTAG TGGTTCAAGA GACTGCA6TT CCAGAAAGAC TAGCCGAGCC CATCCAT6TC 120 

TTCCACTTAA CCCTGCTTTG GGTTACACAT CHAACTTH CTGHCAAGT HCTCTGTGT 180 

AGTTTATAGC ATGAGTATTG GGAWAATGCC CTGAAACCTG ACATGAGATC TGGGAAACAC 240 

AAACTTACTC AATAAGAATT TCTCCCATAT THTATGATG GAAAAATTTC ACATGCACAG 300 

AGGAGTG6AT AGAGACCCTA ACGA 324 
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(2) INFORMATION FOR SEQ ID NO: 188: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 178 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:188: 

GCGCGGGGAT TCGGGGTGAT ACCTCCTCAT GCCAAAATAC AACGTNTAAT HCACAACTT 60 

GCCTTCCAAT TTACGCATTT TCAATTTGCT CTCCCCATTT GTTGAGTCAC AACAAACACC 120 

ATTGCCCAGA AACATGTATT ACCTAACATG CACATACTCT TAAAACTACT CATCCCTT 178 

(2) INFORMATION FOR SEQ ID NO: 189: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 367 base pairs 

(8) TYPL nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

TGACACCTTG TCCAGCATCT GACACAGTCT TGGCTCTTGG AAAATATTGG ATAAATGAAA 60 

ATGAAHTCT TTAGCAAGTG GTATAAGCTG AGAATATACG TATCACATAT CCTCATTCTA 120 

AGACACATTC AGTGTCCCTG AAATTAGAAT AGGACTTACA ATAAGT6TGT TCACHTCTC 180 

AATAGCTGTT ATTCAAHGA TGGTAGGCCT TAAAAGTCAA AGAAATGAGA GGGCATGTGA 240 
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AAAAAAGCTC AACATCACTG ATCATTAGAA AACTTCCATT CAAACCCCCA ATGAGATACC 300 

ATCTCATACC AGTCAGAATG GCTATTATTA AAAAGTCAAA AAATAACAGA TGCTGGACAA 360 

GGT6TCA 367 
(2) INFORMATION FOR SEQ ID NO: 190: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:190: 

GACACCTTGT CCAGCATCTG ACAACGCTAA CAGCCTGAGG AGATCTTTAT TTATTTATTT 60 

AGTTTTTACT CTGGCTAGGC AGATGGTGGC TAAAACATTC AHTACCCAT TTATTCATTT 120 

AATTGTTCCT GCAAGGCCTA T6GATAGAGT ATTGTCCAGC ACTGCTCTGG AAGCTAG6AG 180 

CATGGG6ATG AACAA6ATAG GCTACATCCT GTTCCCACAG AACTTCCACT TTAGTCTGGG 240 

AAACAGATGA TATATACAAA TATATAAATG AATTCAGGTA GTTTTAAGTA CGAAAAGAAT 300 

AAGAAAGCAG AGTCATGATT TANAAT6CTG GAAACAGGGG CTATTGCTTG AGATATTGAA 360 

GGT6CCCAA 369 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 369 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:191: 

TGACACCTTG TCCAGCATCT GCACAGGGAA AAGAAACTAT TATCAGAGTG AACAGGCAAC 60 

CTACAGAAT6 GGAGAAAATT TTTGCAATCT ATCCATCTGA CAAAGGGCTA ATATCCAGAA 120 

TCTACAAAGA ACTTATACAA AHTACAAGA AACAAACAAA CAAACAACTC CTCAAAAAGT 180 

GGGTGAAGGA TGTGAACAGA CACTTCTCAA AAGAA6ACAT TTATGGGGCC AACAAACATA 240 

TGAAAAAAAG CTCATCATCA CTGGTCACTA GATAAATGCA AATCAAAACC ACAATGAGAT 300 

ACCATCTCAT TCCAGTTAGA ATGGCAATCA TTAAAAAGTC AGGAAACAAC AGATGCTGGA 360 



CAAGGTGTC 

(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



369 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 
TGACGCTTGG CCACTTGACA CTTCATCTTT GCACAGAAAA ACTTCTTTAC AGAHTAATT 60 
CAAGACTGGT CTAGTGACAG TCCTCCAGAC ATTTTTTCAT TTGTTCCATA TACGTGGAAT 120 
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TTTAAAATCA TGITTCATCA GTTTGAMTG ATTT66GCTG CTMTCAACA CAATTGGATC 180 

GACTGTTCTA CTAAACAACA GGAAAATGTG TATCT6GCAG CCTGTGGAGA AACACTAAAC 240 

ATTGATTTTT CTTTGCCTTT TACGGACTTT GTTCCAGCTA CATGTAATAC CAAGTTCTCT 300 

TTAAGAGGAG AAGATGTTGA TCTTCATTTG TTTCTACCAG ACTGCCACCC TAGTAAATAT 360 

TCTTTATTTA TGCTGGTAAA AAATTGCCAT CCAAATAAGA TGATTCATGA TACTGGTATT 420 

CCTGCTGAGT GTCAAGTGGC CAAGCGTCA 449 
(2) INFORMATION FOR SEQ ID N0:193: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:193: 

TGACGCTTGG CCACTTGACA CCAGGGATGT AKCAGTTGAA TATAATCCTG CAATTGTACA 60 

TATTGGCAAT TTCCCATCAA ACATTCTAGA AAGAGACAAC CAGGATTGCT AGGCCATAAA 120 

AGCTGCAATA AATAACTGGT AATT6CA6TA ATCATTTCA6 GCCAATTCAA TCCAGTTTG6 180 

CTCAGAGGTG CCTTTGGCTG AGAGAAGAGG TGAGATATAA TGTGTTTTCT TGCAACTTCT 240 

TGGAA6AATA ACTCCACAAT AGTCT6AGGA CTAGATACAA ACaATTTGC CATTAAAGCA 300 

CCAGAGTCTG TTAATTCCAG TACTGATAAG TGTT6GAGAT TAGACTCCAG TGTGTCAAGT 360 

GGCCAAGCGT CA 372 
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(2) INFORMATION FOR SEQ ID NO: 194: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

TGACGCTTGG CCACTTGACA CTTATGTAGA ATCCATCGT6 GGCTGATGCA AGCCCTTTAT 60 

TTAGGCTTAG TGTTGTGG6C ACCTTCAATA TCACACTA6A GACAAAC6CC ACAAGATCTG 120 

CAGAAACATT CAGTTCTGAN CACTCGAATG GCAGGATAAC riTTTGTGTT GTAATCCHC 180 

ACATATACAA AAACAAACTC T6CANTCTCA CGTTACAAAA AAACGTACTG CTGTAAAATA 240 

TTAAGAAGGG GTAAAGGATA CCATCTATAA CAAAGTAACT TACAACTAGT GTCAAGTGGC 300 



CAAGCGTCA 

(2) INFORMATION FOR SEQ ID NO: 195; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



309 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 



TGACGCTTGG CCACTTGACA CCCAATCTCG CACTTCATCC TCCCAGCACC TGATGAAGTA 



60 
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GGACTGCAAC TATCCCCACT TCCCAGATGA GGGGACCAAN GTACACATTA GGACCCGGAT 120 

GGGAGCACAG ATTTGTCCGA TCCCAGACTC CAAGCACTCA GCGTCACTCC AGGACAGCGG 180 

CTTTCAGATA AGGTCACAAA CATGAATGGC TCCGACAACC GGAGTCAGTC CGTGCTGA6T 240 

TAAGGCAATG GTGACACGGA TGCACGTGTN ACCTGTAATG GTTCATCGTA AGTGTCAAGT 300 

GGCCAAGCGT CA 312 
(2) INFORMATION FOR SEQ ID N0:196: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:196: 

TGTATCGACG TAGTGGTCTC CTCAGCCATG CAGAACTGTG ACTCAATTAA ACCTCTTTCC 60 

TTTAT6AATT ACCCAATCTC GGGTAGTGTC TTTATAGTAG TGT6AGAATG GACTAATACA 120 

AGTACATTTT ACTTAGTAAT AATAATAAAC AAATATATTA CATHTTGTG TATTTACTAC 180 

ACCATATTTT TTATTGTTAT TGTAGTGTAC ACCTTCTACT TATTAAAAGA AATAGGCCCG 240 

AG6CGGGCAG ATCACGAGGT CAGGAGATGG AGACCACTAC GTCGATAC 288 
(2) INFORMATION FOR SEQ ID NO: 197: 



(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 289 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION; SEO ID NO:197: 

TTGGGCACCT TCAATATCAT GACAGGTGAT GTGATAACCA AGAAGGCTAC TAAGTGAnA 60 

ATGGGTGGGT AATGTATACA GAGTAGGTAC ACTGGACAGA GGGGTAATTC ATAGCCAAGG 120 

CAGGA6AAGC AGAATGGCAA AACATTTCAT CACACTACTC AG6ATAGCAT GCAGTHAAA 180 

ACCTATAAGT AGTTTATTH TGGAATTTTC CACTTAATAT THCAGACTG CAGGTAACTA 240 

AACTGTGGAA CACAAGAACA TAGATAAGGG GA6ACCACTA CGTCGATAC 289 
(2) INFORMATION FOR SEQ ID NO: 198: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:198: 

GTATCGACGT AGTGGTCTCC CAAGCAGTGG GAAGAAAACG TGAACCAATT AAAATGTATC 60 

AGATACCCCA AAGAAAGGCG CTTGAGTAAA GATTCCAAGT GGGTCACAAT CTCAGATCTT 120 

AAAATTCAGG CTGTCAAAGA GATHGCTAT GAGGTTGCTC TCAATGACTT CAGGCACAGT 180 

CG6CAGGAGA TT6AAGCCCT GGCCAHGTC AAGATGAAG6 AGCHTGTGC CATGTATGGC 240 
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AAGAAAGACC CCAATGAGCG GGACTCCTGG AGACCACTAC GTCGATAC 



288 



(2) INFORMATION FOR SEQ ID NO: 199: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:199: 



6CTTTTTGGG AAAAACNCAA NTG6GGGAAA GGGG6NTTNN TNGCAAGGGG ATAAAGGGGG 60 



AANCCCAGGG TTTCCCCATT CAGGGA6GTG TAAAAAGNCG GCCAGGGGAT TGTAANAGGA 120 



TTCAATAATA GGGGGAATGG GCCCNGAAGT TGCAAGGHC CNGCCCGCCA TGNCCGCGGG 180 



ATTTAGTGAC ATTACGACGS TGGTAATAAA GTGGGSCCAA WAAATATTTG TGATGTGATT 240 



TTTSGACCAG TGAACCCATT GWACAGGACC TCATTTCCTY TGAGATGRTA GCCATAATCA 300 



GATAAAA6RT TAGAAGTYTT TCTGCACGTT AACAGCATCA TTAAATGGAG TGGCATCACC 360 



AATTTCACCC TTTGTTAGCC GATACCTTCC CCTTGAAG6C ATTCAATTAA GTGACCAATC 420 



GTCATACGAG AGGGGATGGC ATGGGGATTG ATGATGATAT CAGGGGTGAT ACCTTCACAG 480 



GTGAAAGGCA TATCCTCTTG TCTATACTGA ATACCACAAG TACCCTHTG ACCATGTCGA 540 



CTAGCAAATT TGTCTCCAAT CTGTGTWATC CCTAACAGAG CGTACCCTTA TTHACAAAA 600 



TTTATATCCT TCCTGATTGA GAGTTACCAT AACCTGATCC ACAATGCCC6 TCTCGCTWGT 660 



TCTGAGAAAA GTGCTACAGT CTCTCTTGGT ATAGCGTCTA TTGGTGCTCT CCAATTCATC 720 
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TTCATTTTTC AGGCAAGGT6 AACTGTTTTG CCTATAATAA CHTCATCTCC TGATACMC6A 780 
AACCCCK6GA RCTATCAAAC CATCATCATC CAGC6TTCKT WATGTYMCTA AATCCCTATT 840 
GCGGCCGCCT GCAGGTCAAC ATATNGGAAA ACCCCCCACC CCTTNGGAQC NTACCHGAA 900 
TTTTCCATAT GTCCCNTAAA HANCTNGNC TTANCCTGGC CNTAACCTNT TCCGGTrTAA 960 

AnCTTTCCG CCCCCNTTCC CCNCCHNNA ACCGGAAACC TTAATTTTNA ACCNGQGGTT 1020 

CCTATCC 1027 
(2) INFORMATION FOR SEQ ID NO: 200: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:200: 

AGTGACATTA CGACGCTGGC CATCTTGAAT CCTAGGGCAT GAAGTTGCCC CAAAGTTCAG 60 

CACTTGGTTA AGCCTGATCC CTCTGGTTTA TCACAAAGAA TAGGATGGGA TAAAGAAAGT 120 

GGACACHAA ATAAGCTATA AATTATATGG TCCHGTCTA GCAGGAGACA ACTGCACA6G 180 

TATACTACCA GC6TCGTAAT GTCACTA 207 

(2) INFORMATION FOR SEQ ID N0:201: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 209 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 201: 

TGGGCACCn CAATATCTAT TAAAAGCACA AATACT6AA6 AACACACCAA GACTATCAAT 60 

GAG6TTACAT CTGGAGTCCT CGATATATCA G6AAAAAAT6 AA6TGAACAT TCACAGAGTT 120 

TTACnCTTT GGGAACTCAA ATGCTAGAAA A6AAAAGGGT GCCCTCTnC TCTGGCTTCC 180 



T6GTCCTATC CAGCGTCGTA ATGTCACTA 
(2) INFORMATION FOR SEO ID N0;202: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



209 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:202: 

NTACGCTGCA ACACTGTGGA GCCACTG6TT HTAHCCCG GCAGGTTATC CA6CAAACAG 60 

TCACTGAACA CACCGAAGAC CGTGGTAT6G TAACCGTTCA CA6TAATCGT TCCA6TCGTC 120 

TGCGGGACCC CGACGAGCGT CACTGGGTAC AGACCAGATT CAGCCGGAAG AGAAAGCGCC 180 

6CAGG6A6AG ACTC6AACTC CACTCCGCTG 6TGAGCAGCC CCATGTHTC AACTCGAA6T 240 

TCAAACGGCA nGGGTTATA TACCATCAGC T6AACTTCAC ACACATCTCC TTGAACCCAC 300 
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T6GAAATCTA TTTTCTTGTT CCGCTCTTCT CCACAGTGTT GCAGCGTAA 349 
(2) INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:203: 

TGCTCCTCTT GCCTTACCAA CCCAAAGCCC ACTGTGAAAT ATGAAGTGAA TGACAAAATT 60 

CAGTTTTCAA CGCAATATA6 TATAGTTTAT CT6ATTCTTT TGATCTCCA6 GACACTTTAA 120 

ACAACTGCTA CCACCACCAC CAACCTAGGG ATTTA66ATT CTCCACAGAC CAGAAAHAT 180 

TTCTCCTHG AGHTCAGGC TCCTCTGGGA CTCCT6TTCA TCAATGGGTG GTAAATGGCT 240 

A 241 
(2) INFORMATION FOR SEQ ID N0:204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:204: 
TAGCCATHA CCACCCATCT GCAAACCSWG ACMWWCARGR CYWGWACKYA GGCGAHTGA 60 
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AGTACTGGTA ATGCTCTGAT CATGTTAGTT ACATAAGT6T GGTCAGTTTA CAAAAATTCA 120 



CAGAACTAAA TACTCAAT6C TATGTGTTCA TGTCTGTGTT TATGTGTGTG TAATGHTCA 180 



ATTAAGTTTT TTTAAAAAAA AGAGATGATT TCCAAATAAG AAAGCCGTGT TGGTAAGGCA 240 



AGAGGAGC 



248 



(2) INFORMATION FOR SEO ID N0:205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:205: 



TACGCTGCAA CACTGTGGAG CCATTCATAC AGGTCCCTAA TTAAGGAACA AGTGATTATG 60 



CTACCTHGC ACGGTTAGGG TACCGCGGCC GHAAACATG TGTCACTGGG CAGGCGGTGC 120 



CTCTAATACT GGTGATGCTA GAGGTGATGT TTTTGGTAAA CAGGCGGGGT AAGATTTGCC 180 



GAGTTCCTTT TACnFFTTT AACCTTTCCT TATGA6CATG CCTGTGTTGG GTTGACAGTG 240 



GGGGTAATAA TGACTTGTTG GTTGATTGTA GATAHGGGC TGTTAATTGT CAGTTCAGT6 300 



TTTTAATCT6 ACGCAGGCTT ATGCGGAGGA GAATGTTTTC ATGTTACTTA TACTAACATT 360 



AGTTCTTCTA TAGGGTGATA GAHGGTCCA ATTGGGTGTG AGGAGTTCAG TTATATGTTT 420 



GGGATTTTTT AGGTAGTGGG TGTTGANCTT GAACGCTHC TTAATTGGTG GCTGCTTTTA 480 



RGCCTACTAT GGGTGGTAAA TGGCT 



505 
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(2) INFORMATION FOR SEQ ID N0:206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:206: 
TA6ACTGACT CATGTCCCCT ACCAAAGCCC ATGTAAGGAG CTGAGTTCTT AAAGACT6AA 60 
GACA6ACTAT TCTCTGGAGA AAAATAAAAT GGAAATTGTA CTTTAAAAAA AAAAAAAATC 120 
GGCCGGGCAT GGTAGCACAC ACCT6TAATC CCAGCTACTA GGGGACAT6A GTCA6TCTA 179 
(2) INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:207: 
A6ACTGACTC ATGTCCCCTA CCCCACCTTC TGCTGT6CTG CCGTGHCCT AACAGGTCAC 60 
AGACTGGTAC TGGTCAGTGG CCTGGGGGTT GGGGACCTCT ATTATATGGG ATACAAATTT 120 
AGGAGTTGGA ATTGACACGA TTTAGTGACT GATGGGATAT GG6TGGTAAA TGGCTA 176 
(2) INFORMATION FOR SEQ ID N0:208: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:208: 

AGACT6ACTC ATGTCCCCTA TTTAACAGGG TCTCTAGTGC TGTGAAAAAA AAAAATGCTG 60 

AACATTGCAT ATAACTTATA TTGTAAGAAA TACTGTACAA TGACTTTATT GCATCTGGGl 120 

AGCTGTAAGG CATGAAGGAT GCCAAGAAGT TTAAGGAATA TG6GTGGTAA ATGGCTAGGG 180 



GACAT6AGTC AGTCTA 
(2) INFORMATION FOR SEQ ID N0:209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



196 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:209: 

GACGCTTGGC CACTTGACAC CTTTTATTTT HAAGGAHC TTAAGTCAH TANGTNACTT 60 

TGTAAGTHT TCCTGTGCCC CCATAAGAAT GATAGCHTA AAAATTATGC TGGGGTAGCA 120 

AAGAAGATAC TTCTA6CTTT AGAATGTGTA GGTATAGCCA GGATTCTTGT GAGGAGGGGT 180 

GATTTAGAGC AAATTTCTTA TTCTCCTTGC CTCATCTGTA ACATGG6GAT AATAATAGAA 240 
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CTGGCTTGAC AA6GTTGGAA TTAGTATTAC ATGGTAAATA CATGTAAAAT GTTTAGAATG 300 
GTGCCAAGTA TCTAGGAAGT ACTTGGGCAT GGGTGGTAAA TGGCT 345 
(2) INFORMATION FOR SEQ ID N0:210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: 
GACGCTTGGC CACTTGACAC TAGAGTAGGG THGGCCAAC TTTTTCTATA AAGGACCAGA 60 
GAGTAAATAT TTCA6GCTTT GT6GGTTGTG CAGTCTCTCT TGCAACTACT CAGCTCTGCC 120 
AHGTAGCAT AGAAATCAGC CATAGACAGG ACAGAAATGA ATGGGTGGTA AATGGCTA 178 
(2) INFORMATION FOR SEQ ID N0:2n: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 



(x1) SEQUENCE DESCRIPTION: SEQ ID N0:211: 
TGGGCACCTT CAATATCTAT CCAGCGCATC TAAATTCGCT HTTTCTTGA TTAAAAATTT 60 
CACCACHGC TGTTTHGCT CATGTATACC AAGTAGCAGT GGTGTGA6GC CATGCTTGTT 120 
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TTTTGATTCG ATATCAGCAC CGTATAAGAG CAGTGCTTTG GCCATTAAH TATCTTCATT 180 

GTAGACAGCA TA6TGTAGAG TGGTATCTCC ATACTCATCT GGAATATTTG GATCAGTGCC 240 

ATGTTCCAGC AACATTAACG CACATTCATC TTCCTGGCAT TGTACGGCCT TTGTCAGAGC 300 

TGTCCTCTTT TTGTTGTCAA GGACATTAAG HGACATCGT CTGTCCAGCA C6AGTTTTAC 360 

TACTTCT6AA TTCCCATTGG CAGAGGCCAG ATGTAGAGCA GTCCTCnTT GCTTGTCCCT 420 

CTTGTTCACA TCAGTGTCCC TGAGCATAAC GGAA 454 
(2) INFORMATION FOR SEQ ID NO:212: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:212: 

TCC6TTATGC CACCCAGAAA ACCTACTGGA GTTACTTATT AACATCAAGG CTGGAACCTA 50 

THGCCTCAG TCCTATCTGA TTCATGAGCA CATGGTTATT ACTGATCGCA TTGAAAACAT. 120 

TGATCACCTG GGTTTCTTTA TTTATCGACT GTGTCATGAC AAGGAAACTT ACAAACTGCA 180 

ACGCAGAGAA ACTATTAAAG GTATTCAGAA ACGTGAAGCC AGCAATTGTT TC6CAATTCG 240 

GCATTTTGAA AACAAATTTG CCGTGGAAAC TTTAATTTGT TCTTGAACAG TCAAGAAAAA 300 

CATTATTGAG GAAAATTAAT ATCACAGCAT AACGGAA 337 
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(2) INFORMATION FOR SEQ ID N0:213: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID N0:213: 



TCGGGTGATG CCTCCTCAGG CATCTTCCAT CCATCTCTTC AAGATTAGCT GTCCCAAATG 60 



TTTTTCCTTC TCTTCTTTAC TGATAAATTT GGACTCCTTC TTGACACTGA TGACAGCTTT 120 



AGTATCCTTC TTGTCACCTT GCAGACTTTA AACATAAAAA TACTCATTGG TTTTAAAAGG 180 



AAAAAAGTAT ACATTAGCAC TATTAAGCTT GGCCTT6AAA CATHTCTAT CTTTTAHAA 240 



ATGTCGGTTA GCTGAACAGA ATTCATTTTA CAATGCA6AG TGAGAAAAGA AGGGA6CTAT 300 



ATGCATTTGA GAATGCAAGC ATTGTCAAAT AAACATTTTA AATGCTTTCT TAAAGTGAGC 360 



ACATACAGAA ATACATTAAG ATATTAGAAA GTGTTTTTGC TTGTGTACTA CTAATTAGGG 420 



AAGCACCTTG TATAGTTCCT CTTCTAAAAT TGAAGTAGAT TTTAAAAACC CATGTAATTT 480 



AAHGAGCTC TCAGTTCAGA TTTTAG6AGA ATTTTAACAG GGATTTGGTT TTGTCTAAAT 540 



TTTGTCAATT TNTTTAGTTA ATCTGTATAA TTTTATAAAT GTCAAACTGT ATTTAGTCCG 600 



TTnCATGCT GCTATGAAAG AAATACCCAN 6ACAGG6TTA TTTATAAANG GAAAGANGTT 660 



AAHTGACTC CCAGTTCACA GGCCTGAGGA NGNATCNCCC GAAATCCTTA TTGCG 



715 



(2) INFORMATION FOR SEQ ID N0:214: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:214: 

GGTAANGNGC ATACNTCGGT GCTCCGGCCG CCGGAGTCGG GGGATTCGGG TGATGCCTCC 60 

TCAGGCCCAC TTGGGCCTGC TTTTCCCAAA TGGCAGCTCC TCTGGACATG CCATTCCHC 120 

TCCCACCTGC CTGATTCTTC ATATGTTGGG TGTCCCTGTT TTTCTGGTGC TATTTCCTGA 180 

CTGCTGTTCA GCTGCCACTG TCCTGCAAAG CCTGCCTTTT TAAATGCCTC ACCATTCCH 240 

CAHTGTHC TTAAATATGG GAAGTGAAAG TGCCACCTGA GGCCGG6CAC AGT6GCTCAC 300 

6CCTGTAATC CCAGCACTTT GGGAGCCTGA GGAGGCATCA CCCGA 345 
(2) INFORMATION FOR SEQ ID NO:215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:215: 
GGTGATGCCT CCTCAGGCGA A6CTCAGGGA GGACAGAAAC CTCCCGTGGA GCA6AAGGGC 60 



AAAAGCTCGC TTGATCTTGA TTTTCAGTAC GAATACAGAC CGTGAAAGC6 GGGCCTCAC6 



120 
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ATCCTTCTGA CCTHTGOGT TTTAAGCAGG AGGTGTCAGA AAAGTTACCA CAGGGATAAC 180 

TGGCTTGTGG CGGCCAAGCG TTCATAGCGA CGTCGCITTT TGATCCTTCG ATGTCGGCTC 240 

TTCCTATCAT TGTGAAGCAG AATTCACCAA GCGnGGATT GnCACCCAC TAATAGGGAA 300 

CGTGAGCTGG 6TTTAGACCG TCGTGAGACA GGITAGTHT ACCCTACTGA TGAT6TGTKG 360 

TTGCCATGGT AATCCTGCTC AGTACGAGAG GAACCGCAGG TTCASACATT TGGTGTATGT 420 

GCTTGCCTT 439 
(2) INFORMATION FOR SEO ID N0:215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANOEONESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:216: 

TGACACCTAT GTCCNGCATC TGHCACAGT TTCCACAAAT AGCCAGCCTT TGGCCACCTC 60 

TCTGTCCTGA GGTATACAAG TATATCAGGA GGTGTATACC HCTCTTCTC TTCCCCACCA 120 

AAGAGAACAT GCAGGCTCTG GAAGCTGTCT TAGGAGCCH TGGGCTCAGA ATTTCAGAGT 180 

CTTGGGTACC nCGATGTGG TCTGGAAGGA GAAACAHGG CTQGGATAA GGAGTACAGC 240 

CGGAGGAGGG TCACAGAGCC CTCAGCTCAA GCCCCTGTGC CTTAGTCTAA AAGCAGCHT 300 

GGATGAG6AA GCAGGTTAAG TAACATACGT AAGCGTACAC AG6TAGAAAG TGCTGG6AGT 360 
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CAGAATTGCA CAGTGTGTAG GAGTAGTACC TCAATCAATG AGGGCAAATC AACTGAAAGA 420 

AGAAGACCNA TTAATGAATT GCTTANGGGG AAGGATCAAG GCTATCATGG AGATCHTCT 480 

AGGAAGATTA TTGTTTANAA TTATGAAAGG ANTA6GGCAG GGACA6GGCC AGAAGTANAA 540 

GANAACATTG CCTATANCCC TTGTCTTGCA CCCAGATGCT GGACAAGGTG TCA 593 
(2) INFORMATION FOR SEO ID N0:217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

TGACACCTTG TCCAGCATCT GACGTGAAGA TGAGCAGCTC AGAGGAGGTG TCCTGGATTT 60 

CCTGGTTCTG TGGGCTCCGT GGCAATGAAT TCTTCTGTGA AGTGGATGAA GACTACATCC 120 

AGGACAAATT TAATCTTACT GGACTCAAT6 AGCAGGTCCC TCACTATCGA CAAGCTCTAG 180 

ACATGATCTT G6ACCTGGAG CCTGATGAAG AACTGGAAGA CAACCCCAAC CAGAGTGACC 240 

TGATTGAGCA GGCAGCCGAG ATGCTTTATG GATT6ATCCA CGCCCGCTAC ATCCTTACCA 300 

ACCGTGGCAT CGCCCAGATG CTGGACAAGG TGTCA 335 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 248 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:218: 

TACGTACTGG TCTTGAAGGT CHAGGTAGA GAAAAAATGT GAATATHAA TCAAAGACTA 60 

TGTATGAAAT 6G6ACTGTAA GTACAGAGGG AAGGGTGGCC CTTATCGCCA 6AAGTTGGTA 120 

GATGCGTCCC CGTCATGAAA T6TT6TGTCA CT6CCCGACA TTTGCCGAAT TACTGAAATT 180 

CCGTAGAATT AGTGCAAAH CTAACGTTGT TCATCTAAGA TTATG6TTCC ATGTTTCTA6 240 

TACTTTTA 248 
(2) INFORMATION FOR SEQ ID N0:219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:219: 

TGACGCTTGG CCACHGACA CAAGTAG66G ATAAGGACAA AGACCCATNA G6TGGCCTGT 60 

CAGCCTTTTG TTACTGHGC TTCCCTGTCA CCACGGCCCC CTCTGTAGGG GTGTGCTGTG 120 

CTCTGTGGAC ATTGGTGCAT TITCACACAT ACCAHCTCT TTCTGCTTCA CAGCAGTCCT 180 

6AGGCGGGAG CACACAGGAC TACCTT6TCA GATGANGATA ATGATGTCTG GCCAACTCAC 240 

CCCCCAACCT TCTCACTAGT TATANGAAGA GCCANGCCTA NAACCTTCTA TCCTGNCCCC 300 
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TTGCCCTATG ACCTCATCCC TGTTCCATGC CCTATTCTGA TTTCTGGTGA ACTTTGGAGC 360 

AGCCTGGTTT NTCCTCCTCA CTCCAGCCTC TCTCCATACC ATGGTANGGG GGTGCTGTTC 420 

CACNCAAANG GTCAGGTGTG TCTGGGGAAT CCTNANANCT GCCNGGAGTT TCCNAN6CAT 480 

TCTTAAAAAC CTTCTTGCCT AATCANATNG T6TCCAGTGG CCAACCNTCN 530 
(2) INFORMATION FOR SEO ID N0:220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:220: 

TGACGCHGG CCACTTGACA CTAAATAGCA TCTTCTAAA6 GCCT6ATTCA GAGTTGTGGA 60 

AAATTCTCCC AGT6TCAGGG ATTGTCAGGA ACAGGGCTGC TCCTGTGCTC ACTTTACCTG 120 

CTGTGHTCT GCTGGAAAAG GAGGGAAGAG GAATGGCTGA TTTTTACCTA ATGTCTCCCA 180 

GTTTTTCATA TTCTTCTTGG ATCCTCTTCT CTGACAACTG TTCCCTTTTG GTCTTCTTCT 240 

TCTTGCTCAG AGAGCAGGTC TCTTTAAAAC TGAGAAGGGA GAATGAGCAA ATGATTAAAG 300 

AAAACACACT TCTGAGGCCC AGAGATCAAA TATTAQGTAA ATACTAAACC GCTTGCCTGC 360 

TGTGGTCACT TTTCTCCTCT TTCACATGCT CTATCCCTCT ATCCCCCACC TAHCATATG 420 

GCTnTATCT GCCAAGHAT CCGGCCTCTC ATCAACCTTC TCCCCTAGCC TACTGGGGGA 480 
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TATCCATCTG GGTCTGTCTC TGGTGTATTG GTGTCAAGT6 GCCAAGCGTC A 53i 

(2) INFORMATION FOR SEQ ID N0:221: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 530 base pairs 
(8) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:221: 

AHGACGCTT GGCCACTTGA CACCC6CCTG CCTGCAATAC TGGGGCAAGG GCCTTCACTG 60 

CTTTCCTGCC ACCAGCTGCC ACTGCACACA GAGATCAGAA ATGCTACCAA CCAAGACTGT 120 

TGGTCCTCAG CCTCTCTGAG GAGAAAGAGC AGAAGCCTGG AAGTCAGAAG AGAAGCTA6A 180 

TCG6CTAC6G CCTTGGCAGC CAGCTTCCCC ACCTGTGGCA ATAAAGTCGT 6CATG6CTTA 240 

ACAATGGGGG CACCTCCTGA GAAACACATT GHAGGCAAT TCGGCGTGTG TTCATCAGAG 300 

CATATTTACA CAAACCTCGA TAGTGCAGCC TACTATCCAC TATTGCTCCT ACGCTGCAAA 360 

CCTGAACAGC ATGGGACTGT ACTGAATACT GGAAGCAGCT GGTGAT6GTA CTTATTT6TG 420 

TATCTAAACA CAGAGAAG6T ACAGTAAGAA TATGGTATCA TAAACTTACA GGGACCGCCA 480 

TCCTATAT6C AGTCTGHGT GACCAAAAT6 TGTCAA6TGG CCAAGCGTCA 530 
(2) INFORMATION FOR SEQ ID N0:222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS; single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO:222: 

TGTATCGACG TAGTGGTCTC CGGGCTACTA GGCCGHGTG TGCTGGTAGT ACCTGGTTCA 60 

CTGAAA6GCG CATCTCCCTC CCCGCGTCGC CCTGAAGCAG GGGGAGGACT TCGCCCAGCC 120 

AAGGCAGTTG TATGAGTTTT AGCTGCGGCA CTTCGAGACC TCT6AGCCCA CCTCCTTCAG 180 

GAGCCTTCCC CGATTAAGGA AQCCAGGGTA AGGATTCCTT CCTCCCCCAG ACACCACGAA 240 

CAAACCACCA CCCCCCCTAT TCTG6CAGCC CATATACATC AGAACGAAAC AAAAATAACA 300 

AATAAACNAA AACCAAAAAA AAAAGA6AAG 6GGAAATGTA TATGTCT6TC CATCCTGTT6 360 

CTTTAGCCTG TCAGCTCCTA NAGGGCAGGG ACCGTGTCTT CCGAATGGTC TGTGCAGCGC 420 

CGACTGCGGG AAGTATC6GA G6AGGAAGCA GAGTCAGCAG AAGTTGAACG GTGG6CCCGG 480 

CGGCTCTT6G GGGCTQGTGT TGTACTTCGA GACCGCTTTC GCTTTTTGTC TTAGATTTAC 540 

GTTTGCTCTT TGGAGTGGGA NACCACTACN TCNATACA 578 

(2) INFORMATION FOR SEQ ID N0:223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:223: 
TGTATCGACG TAGTGGTCTC CTCHGCAAA GGACTGGCTG GTGAATGGH TCCCTGAATT 60 
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ATGGACTTAC CCTAAACATA TCTTATCATC ATTACCAGTT GCAAAATATT AGAATGT6TT 120 

6TCACTGTTT CAmGAHC CTAGAAGGH AGTCTTAGAT ATGTTACTTT AACCT6TATG 180 

CTGTAGTGCT TTGAATGCAT TTTTTGTTT6 CATTTTTGn TGCCCAACCT GTCAATTATA 240 

GCTGCTTAGG TCTGGACT6T CCTGGATAAA GCTGTTAAAA TAHCACCAG TCCAGCCATC 300 

TTACAA6CTA ATTAAGTCAA CTAAATGCTT CCTTGTTTTG CCA6ACTTGT TAT6TCAATC 360 

CTCAAHTCT GGGTTCATTT TGGGTGCCCT AAATCHAGG GTGTGACTTT CTTA6CATCC 420 

TGTAACATCC ATTCCCAAGC AAGCACAACT TCACATAATA CTTTCCAGAA GTTCATTGCT 480 

GAAGCCTTTC CTTCACCCAG CGGAGCAACT T6ATTTTCTA CAACTTCCCT CATCAGAGCC 540 

ACAAGAGTAT GGGATATOSA GACCACTACG TCGATACA 578 
(2) INFORMATION FOR SEQ ID N0:224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID ND:224: 

T6TATCG/C6 TANTGGTCTC CCAAG6TGCT GGGAHGCAG GCATGAGCCA CCACTCCCAG 60 

GTGGATCTTT TTCTTTATAC TTACTTCATT AGGTTTCTGT TAHCAAGAA GTGTA6TGGT 120 

AAAAGTCTTT TCAATCTACA TGGTTAAATA ATGATAGCCT GGGAAATAAA TAGAAATTTT 180 

TTCTTTCATC THAGGTTGA ATAAAGAAAC AGAAAAAATA GAACATACTG AAAATAATCT 240 
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AAGTTCCAAC CATAGAAGAA CTGCAGAAGA AATGAAGAAA GTGATGATGA TTTAGATTTT 300 
6ATATTGATT TAGAAGACAC AGGAGGAGAC CACTACGTCG ATACA 345 
(2) INFORMATION FOR SEO ID N0:225: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:225: 

TGTATCGACG TAGTGGTCTC CAAACTGAGG TATGTGTGCC ACTAGCACAC AAAGCCTTCC 60 

AACAGGGACG CAGGCACAGG CAGTTTAAAG GGAATCTGH TCTAAATTAA HTCCACCTT 120 

CTCTAAGTAT TCTTTCCTAA AACTGATCAA GGTGTGAAGC CTGTGCTCH TCCCAACTCC 180 

CCTTTGACAA CAGCCTTCAA CTAACACAAG AAAAGGCAT6 TCTGACACTC TTCCT6AGTC 240 

T6ACTCTGAT ACGTTGTTCT GATGTCTAAA GAGCTCCAGA ACACCAAAGG 6ACAATTCAG 300 

AATGCTGGTG TATAACAGAC TCCAATGGAG ACCACTACGT CGATACA 347 

(2) INFORMATION FOR SEQ ID NO: 226: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtCIEDNESS: Single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:226: 
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AGGNGNGGGA NTGTATCGAC GTAGTGGTCT CCCAACAGTC TGTCATTCAG TCTGCAGGTG 60 

TCAGTGTTTT GGACAATGAG GCACCAHGT CACTTAHGA CTCCTCAGCT CTAAATGCTG 120 

AAATTAAATC TTGTCATGAC AAGTCTQGAA HCCTGATGA GGTTTTACAA AGTATTTTG6 180 

ATCAATACTC CAACAAATCA GAAAGCCAGA AAGAGGATCC HTCAATATT GCAGAACCAC 240 

GAGTGGATTT ACACACCTCA G6AGACCACT ACGTC6ATAC A 281 
(2) INFORMATION FOR SEO ID N0:227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID NO:227: 

GGGAAACACT TCCTCCCAGC CTTGTAAGGG TTGGAGCCCT CTCCAGTATA TGCTGCAGAA 60 

TTTTTCTCTC G6TTTCTCAG AGGATTATG6 AGTCC6CCTT AAAAAA6GCA AGCTCTGGAC 120 

ACTCTGCAAA GTAGAATGGC CAAAGHTGG AGTTGA6TGG CCCCTTGAAG GGTCACTGAA 180 

CCTCACAATT GHCAAGCTG TGTGGCGGGT TGTTACT6AA ACTCCCGGCC TCCCT6ATCA 240 

GrrrCCCTAC ATTGATCAAT GGCTGAGTTT GGTCAGGAGC ACCCCHCCG TGGCTCCACT 300 

CAT6CACCAT TCATAATTTT ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTHCGCCTC 360 

GACCCTCAGC CGGTTCGGCT CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC 420 

CTCACCCAGT CCCACCGCCT TAAAACCAGC CTACTCCCTT AGGGTCATCC CATGTCTCCT 480 
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CGGCTAT6TC CCCTGTAG6C TCATCACCCA 
AAGTAGCCCC TCTACTACCA CTGAGAGAGG 
CCCCCTTCCT GGTTTATGTC CCTTCmCT 
AATCCTCCCT TCTCTGAAAA GCCCCAGGCT 
ACACATTGGC CCACCTGGGA TGACTGTCAA 
GAGAGGGAAA GTATCCAAAG AGAGGCCAAA 
GAGGAGGAA6 CTA6AGGAAT AGTGATTAGA 
TTTCTCAAGT GGAGGGAGAA CTTTTGACGA 
GGGAGCTGCT CAGAAACCTA TAAACTTGTC 
TGAGTCACCA GGAGTGTTTT TAGAGCACCT 
TGACCTG6CA OICCCCGAAA ATAGCCATGC 
CCCAGATAGT AAAAGGAAAC TCCAAAAACT 
AGCTTTTAGA GATAGCCTAA AAGGTTTTTG 
CAGCTCAGGC AGCTGAAAAA AGCCACTGAT 
TAGATCAGCC TCATTTGACT TCCCCTCCCA 
CTGACTCAAA CTCCACTATT CCTGTTCATG 
CTGGCCGACC TGATCnCAA AATGTGCCCC 
ACAGTAGCAG CTTCCTCGAG AAGGGACTAC 
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TTGCCTCTTG GTTGCAACCG T6GTGGGAGG 
CACAAGTCCC TCTGGGTGAT GAGTGCTCCA 
ACTTCTGACT TGTATAATTG GAAAACCCAT 
TTGACCTCAC TGATGGAGTC TGTACTCTGG 
CAGCTCCTTT TGACCCUTT CACCTCTGAA 
AAGTACAACC TCACATCAAC CAATAGGCCG 
GACCCAATTG GGACCTAATT GGGACCCAAA 
TTTCCACCGG TATCTCCTCG TGGGTATTCA 
TAAGGCGACT GAAGTCGTCC AGGGGCATGA 
CCAGGAGGCT TATCAGATTT ACACCCCTTT 
TCTTAATTTG GCATTTGTGG CTCAGGCAGC 
AGAGGGATTT TGCTGGAATG AATACCAGTC 
ACAGTCAAGA GGTTGAAAAA CAAAAACAAG 
AAAGCATCCT GGAGTATCAG AGTTTACTGT 
CATGGTGTTT AAATCCAGCT ACACTACTTC 
ACTGTCAGGA ACTGTTQGAA ACTACTGAAA 
TAGGAAAGGT GGATGCCACC ATGTTCACAG 
GAAAGGCCGG TGCAGCTGTT ACCATGGA6A 
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CAGATGTGTT GTGGGCTCAG GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA 1620 
TCGCCCTCAC TCAGGCTCTC CGATGGGGTA AGGATATTAA CGTTAACACT GACAGCAGGT 1680 

AC6CCTTTGC TACTGTGCAT GTACGTGGAG CCATCTACCA GGAGCGTGGG CTACTCACCT 1740 

CAGCAGGTGG CTGTAATCCA CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT 1800 

GGTAACCAGA AA6CTGATTC AGCAGCTCAA GATGCAGTGT GACTTTCAGT CACGCCTCTA 1860 

AACTTGCTGC CCACAGTCTC CTTTCCACAG CCAGATCTGC CTGACAATCC CGCATACTCA 1920 

ACAGAAGAAG AAAACTGGCC TCAGAACTCA GAGCCAATAA AAATCAGGAA GGHGGTGGA 1980 

TTCTTCCTGA CTCTAGAATC TTCATACCCC GAACTCTTGG 6AAAACTTTA ATCAGTCACC 2040 

TACAGTCTAC CACCCAHTA GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTnA 2100 

AGATCCCCCA TCTTCAAAGC CTAACA6ATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC 2160 

AGGTAAATGC CAAAAAAGGT CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC 2220 

CAGGAGAAAA GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT 2280 

ACCTTCTAGT ACTGGTAGAC ACCTTCTCTG GATGGACTGA AGCAinGCT ACCAAAAACG 2340 

AAACTGTCAA TATGGTAGH AAGTmiAC TCAAT6AAAT CATCCCTCGA CATGGGCTGC 2400 

CIGTHGCCA TAG66TCTGA TAATGGACCG GCCHCGCCT TGTCTATA6T TTAGTCAGTC 2460 

AGTAAGGCGT TAAACAHCA ATGGAAGCTC CATTGT6CCT ATCGACCCCA GAGCTCTGGG 2520 

CAAGTAGAAC GCATGAACTG CACCCTAAAA AACACTCTTA CAAAAHAAT CTTAGAAACC 2580 

GGTGTAAATT GTGTAAGTCT CCTTCCTTTA GCCCTACHA GAGTAAGGTG CACCCCTTAC 2640 



wo 97/25426 PCTA)S97/00485 

190 

TGGGCTGGGT TCTTACCHT TGAAATCATG TATGGGAGGG TGCTGCCTAT CTTGCCTAAG 2700 

CTAA6AGATG CCCAATTG6C AAAAATATCA CAAACTAAH TATTACAGTA CCTACAGTCT 2760 

CCCCAACAGG TACAAGATAT CATCCT6CCA CTTGHCGAG GAACCCATCC CAATCCAAH 2820 

CCTGAACAGA CAGGGCCCT6 CCATTCAHC CCGCCAGGTG ACCTGITGH TGTTAAAAAG 2880 

TTCCAGAGAG AAGGACTCCC TCCTGCnGG AAGAGACCTC ACACCGTCAT CACGAT6CCA 2940 

ACGGCTCTGA AGGTGGATGG CAHCCTGCG TGGATTCATC ACTCCCGCAT CAAAAAGGCC 3000 

AACAGAGCCC AACTAGAAAC ATGGGTCCCC AQGGCTGQGT CAGGCCCCTT AAAACTGCAC 3060 

CTAA6TTGGG TGAAGCCATT AGATTAATTC TTTTTCnAA THTGIAAAA CAATGCATAG 3120 

CTTCTGTCAA ACTTATGTAT CTTAAGACTC AATATAACCC CCTTGTTATA ACTGAGGAAT 3180 

CAAT6ATTTG ATTCCCCCAA AAACACAAGT GGGGAATGTA GTGTCCAACC TGGITTTTAC 3240 

TAACCCTGTT niAGACTCT CCCTnCCH TAATCACTCA GCTTGTTTCC ACCTGAAHG 3300 

ACTCTCCCTT AGCTAAGAGC GCCAGATGGA CTCCATCTTG GCTCmCAC TGGCAGCCGC 3360 

TTCCTCAAGG ACHAACTTG TGCAAGCTGA CTCCCAGCAC ATCCAAGAAT QCAAHAACT 3420 

GATAAGATAC TGTGGCAAGC TATATCCGCA GTTCCCAGGA ATTCGTCCAA TTGATCACAG 3480 

CCCCTCTACC CTTCAGCAAC CACCACCCT6 ATCAGTCA6C AGCCATCAGC ACC6A6GCAA 3540 

GGCCCTCCAC CAGCAAAAAG ATTCTGACTC ACTGAAGACT TGGATGATCA TTAGTAUTT 3600 

TAGCAGTAAA GITTnnn CTTTTTCTTT CIllllllCT CGTQCC 3646 
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Claims 

1 . An isolated DNA molecule, comprising: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID 
N0:1, SEQ ID N0:3 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166, 168- 
176, 178-192, 194-198, 200-204, 206, 207,209-214,216,218,219,221-227; 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained; or 

(c) a nucleotide sequence encoding an epitope of a polypeptide encoded by 
at least one sequence selected from the group consisting of SEQ ID N0:1, SEQ ID N0:3 - 
SEQ IDNO:77 and SEQ IDNOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 
200-204, 206, 207, 209-214, 216, 218, 219, 221-227. 

2. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said polypeptide is encoded by a nucleotide sequence that: 

(a) hybridizes to a sequence selected from the group consisting of SEQ ID 
N0:1, SEQ ID NO:3 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 146-152, 154-166, 168- 
176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-227 under stringent 
conditions; and 

(b) is at least 80% identical to a sequence selected from the group 
consisting of SEQ ID N0:1, SEQ ID N0:3 - SEQ ID NO:77 and SEQ ID NOS:142, 143, 
146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 
221-227; 

and wherein RNA corresponding to said nucleotide sequence is expressed at a greater level in 
human breast tumor tissue than in normal breast tissue. 



3. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said polypeptide is encoded by: 
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(a) a nucleotide sequence transcribed from the sequence of SEQ ID NO: 

141; or 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained. 

4. An isolated DNA or RNA molecule comprising a nucleotide sequence 
complementary to a DNA molecule according to any one of claims 1-3. 

5. A recombinant expression vector comprising a DNA molecule 
according to any one of claims 1-3. 

6. A host cell transformed or transfected with an expression vector 
according to claim S. 

7. A polypeptide comprising an amino acid sequence encoded by a DNA 
molecule according to any one of claims 1-3. 

8. A polypeptide according to claim 7 wherein said polypeptide 
comprises an epitope of an amino acid sequence encoded by at least one nucleotide sequence 
selected from the group consisting of SEQ ID N0:1, SEQ ID N0:3 - SEQ ID NO:77 and 
SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 
209-214, 216, 218, 219, 221-227. 



9. A monoclonal antibody that binds to a polypeptide according to 

claim 7. 
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10. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one polypeptide according to claim 
7, and therefrom determining the presence of breast cancer in the patient. 

11. A method for determining the presence of breast cancer in a patient 
comprising detecting within a biological sample, at least one polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 and SEQ ID 
NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences that 
hybridize thereto under stringent conditions. 

12. The method of claims 10 or 11 wherein the biological sample is a 
portion of a breast tumor. 

13. The method of claim 10 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody according to claim 9. 

14. The method of claim 11 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 
and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences 
that hybridize thereto under stringent conditions. 

15. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, an RNA molecule encoding at least one 
polypeptide according to claim 7, and therefrom determining the presence of breast cancer in 
the patient. 



16. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one RNA molecule encoding a 
polypeptide encoded by a nucleotide sequence selected from the group consisting of SEQ ID 
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NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and 
sequences that hybridize thereto under stringent conditions; and therefrom determining the 
presence of breast cancer in the patient. 

17. The method of claims 15 or 16 wherein the biological sample is a 
portion of a breast tumor. 

1 8. The method of claim 1 5 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7, and therefrom determining the 
presence of breast cancer in the patient. 

19. The method of claim 16 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide encoded by a nucleotide sequence selected fi-om the group 
consisting of SEQ ID NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 
208, 215, 217, 220, and sequences that hybridize thereto under stringent conditions; and 
therefrom determining the presence of breast cancer in the patient. 

20. A polypeptide according to claim 7 for use within a method for 
detecting the presence of breast cancer in a patient. 

21. A polypeptide encoded by a nucleotide sequence selected from the 
group consisting of SEQ ID NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 
205, 208, 215, 217, 220, and sequences that hybridize thereto under stringent conditions, for 
use within a method for detecting the presence of breast cancer in a patient. 
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22. A method for monitoring the progression of breast cancer in a 
patient, comprising: 

(a) detecting an amount, in a biological sample, of at least one polypeptide 
according to claim 7 at a first point in time; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

23. A method for monitoring the progression of breast cancer in a 
patient, comprising: 

(a) detecting in a biological sample an amount of at least one polypeptide 
at a first point in time, the polypeptide being encoded by a nucleotide sequence selected from 
the group consisting of SEQ ID NO: 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 
199, 205, 208, 215, 217, 220, and sequences that hybridize thereto under stringent conditions; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

24. The method of claims 22 or 23 wherein the biological sample is a 
portion of a breast tumor. 

25. The method of claim 22 wherein the step of detecting comprises 
contacting a portion of the biological sample with a monoclonal antibody according to 
claim 9. 

26. The method of claim 23 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 
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and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences 
that hybridize thereto under stringent conditions. 

27. The method of either of claims 20 or 22 wherein said polypeptide 
comprises an epitope of an amino acid sequence encoded by at least one nucleotide sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID N0:3 - SEQ ID NO:77 and 
SEQ ID NOS:142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 
209-214, 216, 218, 219, 221-227. 

28. A method for monitoring the progression of breast cancer in a 
patient, comprising: 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule encoding a polypeptide according to claim 7 at a first point in time; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

29. The method of claim 28 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7. 

30. A method for monitoring the progression of breast cancer in a 
patient, comprising: 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule at a first point in time, the RNA molecule encoding a polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 and SEQ ID 
NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences that 
hybridize thereto under stringent conditions; 
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(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

31. A pharmaceutical composition, comprising a polypeptide according to 
claim 7 and a physiologically acceptable carrier. 

32. A pharmaceutical composition for inhibiting the development of breast 
cancer, comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
being encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 
78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and 
sequences that hybridize thereto under stringent conditions. 

33. A vaccine, comprising a polypeptide according to claim 7 and an 
inmiune response enhancer. 

34. A vaccine, comprising a DNA molecule according to any one of 

claims 1*3. 

35. A vaccine, comprising a recombinant expression vector comprising a 
DNA molecule according to any one of claims 1-3. 

36. A vaccine for inhibiting the development of breast cancer, comprising 
a polypeptide and an inunime response enhancer, the polypeptide being encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86 and SEQ ID 
NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, and sequences that 
hybridize thereto under stringent conditions. 

37. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies according to claim 9; and 
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(b) a detection reagent. 

38. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a nucleotide sequence selected from the group consisting of sequences provided in SEQ 
ID 78-86 and SEQ ID NOS:144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220; and 

(b) a detection reagent. 

39. The kit of any one of claims 37 or 38 wherein the monoclonal 
antibody(s) are immobilized on a solid support. 

40. A diagnostic kit comprising a first polymerase chain reaction primer 
and a second polymerase chain reaction primer, the first and second primers each comprising 
at least about 10 contiguous nucleotides of an RNA molecule according to claim 4. 

41. A diagnostic kit comprising a first polymerase chain reaction primer 
and a second polymerase chain reaction primer, the first and second primers each comprising 
at least about 10 contiguous nucleotides of an RNA molecule encoding a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NOS:78-86 
and SEQ IDNOS:144, 145, 153, 167, 177, 193, 199, 205, 208,215,217, 220. 

42. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe containing at least about 15 contiguous nucleotides of a DNA molecule 
according to claim 4. 

43. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe comprising at least about 15 contiguous nucleotides of a DNA 
sequence selected from the group consisting of SEQ ID NOS:78-86 and SEQ ID NOS:144, 
145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220. 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA BlSAgl 

TTA GAG ACC CAA HG G6A CCT AAT TGG GAC CCA AAT TTC TCA AGT GGA 48 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Sen Gly 
15 10 15 

GGG A6A ACT TTT GAC GAT TTC CAC CGG TAT CTC CTC GTG 6GT ATT CAG 96 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 
20 25 30 

GGA GCT GCC CAG AAA CCT ATA AAC HG TCT AAG GCG ATT GAA GTC GTC 144 
Gly Ala Ala Gin Lys Pro He Asn Leu Ser Lys Ala He Glu Val Val 
35 40 45 

CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 
Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
50 55 60 



GCT TTT AGA GAT AGC CTA AAA GGT TTT 
Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 



240 



GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT AGC 
Ala Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

CAT GCT Cn AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 288 
His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 95 

AG6 AAA CTC CAA AAA CTA GAG GGA TTT T6C TGG AAT GAA TAC CAG TCA 336 
Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 



363 



FIG. 6 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Agl 

GGGCACAGTG GCTCATACCT GTAATCCT6A CCGTTTCAGA GGCTCAGGTG GGGGGATCGC 
TTGAGCCCAA GATTTCAAGA CTAGTaGGG TAACATAGTG AGACCCTATC TCTACGAAAA 
AATAAAAAAA TGAGCCTGGT GTAGTGGCAC ACACCAGCTG AGGAGGGA6A ATCG 



FIG, 7 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag2 

TGGGG6CTCT 6ACTAGAAAT TCAAGGAACC TGGGATTCAA GTCCAACT6T GACACCAACT 
TACACTGT6G NCTCCAATAA ACTGCTTCn TCCTATTCCC TCTCTAnAA ATAAAATAAG 
GAAAACGATG TCTGTGTATA GCCAAGTCAG ffTATCCTAAA AGGAGATACT AAGTGACATT 
AAATATCAGA ATGTAAAACC TGGGAACCAG GTTCCCAGCC TGGGATTAAA CT6ACAGCAA 
GAAGACTGAA CAGTACTACT GTGAAAA6CC CGAAGNGGCA ATATGTrCAC TCTACCGTTG 
AAG6ATGGCT 6GGAGAATGA ATGCTCT6TC CCCCAGTCCC AAGCTCACn ACTATACCTC 
CTTTAT 

FIG. S 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag2a 

TATAATCATG TFTCTCATTA TTTTCACATT TTATTACCAA TTTCTGnTA CCCT6AAAAA 60 

TATGAGGGAA ATATAT6AAA CAGGGAGGCA ATGHCAGAT AATTGATCAC AAGATATGAT 120 

TTCTACATCA GATGCTGTTr CCnTCCTGT TTATrTCCTT TTTATITCGG TTGTGGGGTT 180 

6AATGTAATA GCnTGTTTC AAGAGAGAGT nTGGCAGTT TCTGTAGCH CTGACACTGC 240 

TCATGTCTCC AGGCATCTAT TTGCACrTTA GGAGGTGICG TGGGA6ACTG AGAGGTCTAT 300 

TmrCCATA TTTG 314 

FIG. 9 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Aglb 

ATACAGTC66 TTTCCATTTA TTTAACCCCC ACCTGAACGG CATAAACTGA GTGTTCAGCT 60 

GGTGTrrTTT ACT6TAAACA ATAAGGAGAC TTTGCTCTrC AnTAAACCA AAATCATATT 120 

TCATATTTTA CGCTCGAGGG TnTTACCGG TTCCTmTA CACTCCTTAA AACAGTmT 180 

AACTCGTITG 6AACAA6ATA mTTTCTTT CCT66CAGCT TrTAACATTA TAGCAAATn 240 

GTGTCTGGGG GACTGCT6GT CACTGTTrCT CACAGTTGCA AATCAAGGCA TTTGCAACCA 300 

A6AAAAAAAA AinilllGT TnATTTGAA ACT66ACCGG ATAAACGGT6 nTGGAGCGG 360 

CTGCTGTATA TAGnTTAAA TGGTTTAnG CACCTCCnA AGTTGCACTT ATGT 414 

FIG, 10 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Agla 

TATATATTTA ATAACTTAAA TATATTTTGA TCACCCACT6 GGGTGATAA6 ACAATAGATA 60 

TAAAAGTATT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 120 

TCCCCCACCC GCACTGAAAC TTCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 180 

AAGTCTHGG T6CGTGCTCA CTACTCTTTT H 1 1 1 i 1 1 1 r TTTNTTTTGG AGATGGAGTC 240 

TGGCT6TGCA GCCCA6GGGT GGAGTACAAT GGCACAACCT CAGCTCACTG NAACCTCCGC 300 

CTCCCAGGTT CAT6AGATTC TCCTGNTTCA GCCTTCCCA6 TAGCTGGGAC TACAGGTGTG 360 

CATCACCATG CCTGGNTAAT CTTTTTTN6T TTTNGGGTAG AGAT6G6GGT TTTACATGTT 420 

GGCCAGGNTG GTNTCGAACT CCTGACCTCA A6T6ATCCAC CCACCTCAG6 CTCCCAAAGT 480 

GCTAGGAHA CAGACATGAG CC 502 

FIG, 11 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST- TUMOR SPECIFIC cDNA BllAgl 



ACATGCA6AA TATTCTATCG GTACTTCAGC 


TATTACTCAT THGAIGGCG CAATCCGAGC 


60 


CTATCCTCAA GAT6AGTATT TAGAAAGAAT 


TGATTTAGCG ATAGACCAAG CTG6TAAGCA 


120 


CTCTGACTAC ACGAAAHGT TCAGATGTGA 


TGGATTTATG ACAGTTGATC TTT6GAAGA6 


180 


ATTATTAAGT GATTATTTTA AAQG6AATCC 


ATTAATTCCA GAATATCTTG GTTrAGCTCA 


240 


AGATGATATA 6AAATA6AAC AGAAAGA6AC 


TACAAATGAA GATGTATCAC CAACTGATAT 


300 


TGAAGAGCCT ATA6TA6AAA ATGAAHAGC 


TGCATTTATT AGCCHACAC ATAGCGATTT 


360 


TCCTGATGAA TCTTATATTC AGCCATCGAC 


ATAGCAHAC CTGATGGGCA ACCCTTACGA 


420 


ATAATAGAAA CTGGGTGCGG GGCTATTGAT 


GAAnCATCC NCAGTAAATT TGGATATNAC 


480 


AAAATATAAC TCGATTGCAT 




500 




FIG. 12 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3c 

ACTGATGGAT GTCGCCGGAG GCGAGG6GCC TTATCTGATG CTCGGCTGCC TGTTCGT6AT 
GTGCGCG6C6 ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 
TGCCTTAGC6 GCGGC6AAGT CAATGG6C6T CTCACCCTAT CCTTTrGCCA TGGTG6TGGC 
6ATG6CGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTG6TGCT 
TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTC 

FIG. 13 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG1 

CAGCCCCTTC TTCrCAATFT CATCTGTCAC TACCCTGGTG TAGTATCTCA TAGCCTTACA 60 

TmTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC CATATCACAC 120 

ATAACTGCAA GTAAACATTT CTAAAGTGTG GHATGCTCA TGTCACTCCT GTGCCAAGAA 180 

ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTT6TTC TTTCCTATTN TCAaCTTCA 240 

^ 241 

FIG. 14 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG3 

CAAA6CCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC ATTTATGATA 
AAT6GTG6CA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT AACTCTGAGA 
TATAnCTTC TACATTTAAA CAATAAAAAT AATCTATTTT TAAAAGCCTA ATTTGCGTAG 
TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA CGTTTCTCTG 



FIG. 15 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B2CA2 

CGACGTCGGT AAAATCGGAC ATGAAGCCAC CGCTGGTCTT TTCGTCCGAG CGATAGGCGC 60 

C6GCCAGCCA GCGGAACGGT TGCCCG6ATG GCGAAGCGAG CCGGAGHCT TCGGACTGAG 120 

TATGAATCn GTTGTGAAAA TACTCGCCGC CnCGTrCGA CGACGTCGCG TCGAAATCn 180 

CGAACTCCTT ACGATCGAAG TCHCGTGGG C6ACGATCGC GGTCAGTTCC GCCCCACCGA 240 

AATCATGGTT GAGCCGGATG CTGCCCCCGA AGCCCT 276 

FIG. 16 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA1 

CCCAGGTCAA CCAGGCTGCA ACACGCAGGT CCnGGATTG G6CACGAAGC AGCGCTTCGC 
TGrmCCAG GATTTTCAAC CAGTCGGTCT GGCCGTTCTC ATGGAGC6AG AGCGCCTTGC 
CCAGCTCATT TTCCAGCGCC TCGTATTCGC TGGAAAAACG CACATCCTCA CCCGCAAAGA 
CATCCTTTGA AATCGGCTGT TCCGCGAGH CCA6ATANTG CGAGGAGAGC TTGCTCGAAT 
AGGTCATCCT AACCCHCAA TGCACACCAT GTGCGCCAAT GAATATCTTA ACAATTCAAC 
TAGTTGGCAT AANAACCGAA C6AAAATCCC AATA6TCTGA AGAGCTCTTT TG 

FIG. 17 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA2 

CT6CAT6TCC AC6GCCTGGA TrTACGGGTG GTCGGCGTTC ACCCCTGGCA GCTGGCGCTC 60 

TTCCCGACCA GGCCCAGCAG 6ATGTGTGGG GCAAGGATAA CGGCGTGCGC ATCGCCTCGA 120 

CCTATATGCC TACTGGCAAG GCCGAGCCCG TGGAAGGCGG ATTCAGGTTC ANCGGTCGCT 180 

GGAGCrnnrC CACCGGCTCC ATGCATTGTG ACTGGCTGTT TCTAGGCGGT CTGTTGCCCA 240 

AGCGTGATGG TACGTCTGGC CTGGAGCATG TGACTTTCTG 280 

FIG. 18 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3 

AGCAAGGAGA AGGCCAAGGA GAGGCTCAAG CTG6TCCTGG CCTACGACTG GGCCAAGCTG 60 

TCGCCGQGGA TGGTGGAGAA CCTGAAGCGG 6ACCTCCTCG AGGTCCTCCG CCGCTACTTC 120 

TCCGTCCAGG AGGAGGGTCT TTCCGTGGTC HGGAGGAGC GGGGGGA6AA GATNCTCCTC 180 

ATGGTCNACA TCCCCCTCAN GTGATG6TCC TGANGNGNCC CNTCCTCCTT GNCTACGAH 240 

TCGGNCTGGT GGCCCTNTTT CT 262 

FIG, 19 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B4CA1 

AGGAGCGGGT AGAGTG6CAC CATTGAGGGG ATATTCAAAA ATATTATTrT GTCCTAAATG 60 

ATAGHGCTG AGmTTCTT TGACCCATGA GHATAnGG AGTTrATnT TTAACmCC 120 

AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGATT NCTATTTITA TTAAATTGGA 180 

TTTGA6AAAT TGGTTNnAT TATATCAAH TTrGGTATTT GTTGAGTrTG ACAHATAGC 240 

TTAGTATGT 249 

FIG. 20 
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